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A New Carrier System for Rural Service 


By R. C BOYD, J. D. HOWARD, JR, and L. PEDERSEN 


(Manuscript received July 19, 1956) 


A study of the problem of providing telephone service to rural customers 
indicated the need for a flexible carrier system that could be used economically 
on new and existing rural cable and open wire lines. The desire for low cost 
required new approaches to almost every phase of the carrier system design 
for rural service, which has been designated Type P1. 

Use of transistors led to sweeping changes in the detailed circuitry and 
also created demand for other new components. Mounting and intercon- 
necting the circuit components by means of printed wiring boards empha- 
sized the necessity for close coordination between design and manufacturing 
objectives. The low power-drain requirements of wansistor circuitry were 
supplied economically by the use of similar solid state devices, a new storage 
battery, and efficient packaging. 

A fast, accurate and simple method has been evolved for applying the P1 
carrier system to rural lines with a minimum of line treatment or rearrange- 
ment. Plug-in equipment, readily accessible test points, and a carrier test 
set provide the ease of maintenance needed in the use of telephone equipment 
at remote locations. Use of the P1 carrier system will extend the application 
of electronic equipment outside of the telephone central office and provide 
a carrier system whose performance will be consistent with requirements 
for high quality communication service at low cost. 
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1. INTRODUCTION 


Although carrier has been used successfully to provide trunks in the 
Bell System for more than 35 years, it has not been economically feasible, 
up to the present time, to apply carrier telephone techniques extensively 
to the rural telephone plant. The technical and economic problems as- 
sociated with providing telephone service to customers in rural areas has 
long been one of the most difficult problems facing the telephone indus- 
try. The widely scattered locations of customers in rural areas have led 
to a large number of rural telephone routes with only a few customer 
lines per route. This has precluded the use of large cables on any one 
route, which would be economically attractive in urban areas. The ex- 
tensive use of carrier has not been feasible because the distances from 
the rural customers to the Central Office are in the 5- to 20-mile range 
in which carrier has not been generally economical in the past. 

The two lines of attack which were taken on this problem were to 
reduce the cost of telephone plant through less expensive small cables 
and open-wire plant,! and to provide an economically attractive carrier 
system designed to meet the particular needs of rural telephone service. 

This paper discusses the broad objectives for a rural customer carrier 
system, the major parameters of the Pl system which was developed to 
meet those objectives, and its circuit, equipment, and power arrange- 
ments. It also covers the engineering and maintenance methods to be 
used by the Bell System Operating Companies to install and operate 
the system.’ 


2. BROAD OBJECTIVES FOR Pl CARRIER SYSTEM 


The broad objectives for the Type P1 carrier system resulted from 
the stringent economic limits imposed on the system to enable it to prove 
in over conventional rural plant of the latest and most economical de- 
sign, from the requirements of rural telephone transmission and signaling, 
and from Bell System experience gained with earlier carrier systems for 
customer and trunk use. The low cost objective for this system also im- 
plied the need to achieve an appropriate balance among an economic 
first cost of equipment, low in-place cost due to simplified engineering 
and installation practices, and accompanying low annual costs due in 
part to simplified system maintenance. 

To achieve an economic carrier system for rural telephone use, the de 
power requirements of the terminals and repeaters had to be kept low. 


1 Lester Hochgraf and R. G. Watling, Telephone Lines for Rural Subscriber 
Service, A.I.E... Communication and Electronics, No. 18, p. 171, May, 1955. 

2 These aspects of the Pl system are covered in more detail in four papers on 
“The Pl Carrier System.’”’ A.I.E.£. Communication and Electronics, No. 24, 
pp. 188, 191, 195, 205, May, 1956. 


4 
| 


THE TYPE Pl CARRIER SYSTEM 


This was especially important since previous Bell System experience 
indicated that where commercial power is used to supply the system, 
some form of reserve must be provided, and where commercial power is 
not available, the use of primary batteries places a premium on mini- 
mizing the power required. 

From these considerations two additional major objectives were de- 
rived: low manufacturing costs for the components and assembled equip- 
ment, and the use of transistors to minimize power supply drains. In 
addition, flexibility was needed in the proposed carrier system because 
of the difficulty of accurately forecasting the demand for rural service. 

These objectives have been met in the design of the Type P1 rural 
customer telephone system. It is a fully-transistorized system con- 
sisting of independent two-way carrier channels applicable in increments 
-of one to four at a time in the frequency band above the regular voice 
frequency circuit. Each channel uses a terminal at the central office and 
at a remote point with intermediate repeaters as necessary. Between 
terminals, the system is equivalent to a rural voice frequency line with no 
changes required in the central office or rural customer equipment. Be- 
yond the outlying terminal, distribution is by voice frequency wire on 
a single or multiparty basis. The system can be applied to existing and 
new lines utilizing combinations of fine gauge exchange cable and copper 


or steel open-wire. Systems can be used on each of several pairs on a 
given pole line, the number depending on the line characteristics. 


3. MAJOR PARAMETERS OF Pl CARRIER SYSTEM 


This section summarizes the important features incorporated in the 
P1 carrier system and the reasons governing their choice. The system 
has a number of features in common with Bell System toll carrier sys- 
tems, but it also differs in several important aspects because of specific 
rural requirements. One aspect is the signaling, which requires different 
arrangements at the two ends of the circuit because of the widely differ- 
ent signals carried in the two directions. Another is that the remote ter- 
minals of the individual channels are usually distributed along the line 
rather than grouped at a common location. 


3.1. Transmission Plan 


It is difficult, to divorce the considerations leading to the choice of 
carrier frequency range from those affecting the choice of modulation 
in the carrier system. Studies of growth on rural lines indicated that a 
system giving three or four channels (customer circuits) on one pair of 
wires, in addition to the physical circuit, should be sufficient if systems 
could be applied to each of several pairs on a given open-wire line. 
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The blocking out of the frequency range was controlled by a number 
of factors. Cost considerations required that the carrier frequencies be 
kept above the voice frequency range. If carrier extended into the voice 
range, the voice frequency circuit would be lost on a carrier pair. One 
of the carrier channels applied to that pair would have to be used to re- 
place it. Thus, the addition of four carrier channels to a pair would yield 
a net gain of only three channels. This in turn would increase the net 
cost per gained channel. Filter costs determined how close to the voice 
frequency band the carrier frequency range could be placed and in con- 
junction with the number of channels required how closely the channels 
could be placed to each other. 

Crosstalk considerations restricted the carrier frequency range to below 
about 100 ke in order to reduce the cost of line treatment and rearrange- 
ment of pairs on existing rural lines. By using this frequency range it 
appeared possible to apply more than one carrier system to crossarms on 
an open-wire route. The rapid rise in attenuation with frequency of steel 
wire used on rural lines dictated that the range of frequencies be kept low. 
As a result of these two sets of considerations, development work on 
the P1 carrier system was concentrated in the 8- to 100-ke range. 

Amplitude modulation of the carrier frequencies was chosen over 
other forms of modulation because of the simpler terminal circuitry 
and equipment and because of the saving in bandwidth. Use of ampli- 
tude modulation and the use of compandors, discussed in a later section, 
were felt to compensate for possible transmission advantages that could 
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be obtained by using angular modulation (frequency or phase) with a 
large modulation coefficient. 

Cost was again a major factor in the choice between double sideband 
and single sideband amplitude modulation. Past experience with other 
carrier systems has indicated that filters are a major part of the cost of a 
system and, when frequency space is available, double sideband filters 
are, in general, less expensive than those for single sideband. In addition, 
the cost of a single sideband system would be increased because of the 
problem of obtaining the necessary carrier supply at the terminal. 

The frequency plan developed for the PI carrier system is shown in 
Fig. 1. The unusually wide carrier spacing of 12 ke was adopted in order 
to minimize filter costs. Since the remote terminals are generally distribu- 
ted along the line, it was not practical to use double modulation to ac- 
complish filtering in the most efficient frequency range. Instead, filtering 
was done at line frequencies. Every effort was made to achieve channel 
filter designs with maximum efficiency of element utilization. Advantage 
was taken of the more leisurely rising characteristics of the double side- 
band filters permitted by the wide frequency spacing. 

The stackable frequency arrangement was provided for non-repeatered 
operation, because when the lowest two carrier frequencies are used to 
provide a channel, it can be used over substantially longer distances than 
channels using higher frequencies. The grouped arrangements were pro- 
vided for repeatered systems to reduce the cost and number of the re- 
peater filters and amplifers needed to separate the two directions of 
transmission. The staggered grouped arrangement can be used with the 
normal grouped arrangement on a pole line having poor crosstalk coup- 
ling in order to increase the effective coupling loss between carrier 
channels on different pairs. The grouped and stackable arrangements 
cannot be used on the same pole line, because certain frequencies would 
be used for both directions of transmission. This would produce large 
differences between transmitted and received carrier power at terminals 
and repeaters which would lead to intolerable crosstalk. 

A number of terminal arrangements were studied in order to implement 
the above frequency plan. The arrangement for a remote terminal shown 
in Fig. 2 was chosen as the simplest terminal meeting all of the system 
requirements. It is very similar to the channel terminal arrangement 
used in the Type N1 carrier system, another double sideband amplitude 
modulation system used for long distance trunks of the Bell System. The 
several shaded portions in the figure show the breakdown of the terminal 
functions into individual sub-units, which are the basis for the equipment 
arrangements discussed in Section 5 of this paper. A number of the other 
important features that make up the terminal arrangement are discussed 
in the following sections. 
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THE TYPE Pl CARRIER SYSTEM 
Use of Transistors 


Transistors were chosen for use in the P1 system because they are low 
voltage, low power devices as compared to electron tubes suitable for 
transmission circuitry. Also, transistors are expected to be lower in cost 
and inherently longer life devices than electron tubes, thus contributing 
to reduced initial and operating costs. 


The de power requirements for the P1 system, using transistors, may 
be compared to those for a channel terminal in the Type NI system as 
an indication of the de power saving that has been achieved with the 
Pl system. A transistorized P1 terminal requires about 1.2 watts while 
it is in operation compared to 40 watts required for an N1 terminal, 


Fig. 3. — PI transistor transmitting amplifier circuit. 


which represents a substantial power reduction achieved by the use of 
transistors. Because part of the Pl terminal is turned off during idle 
periods, the average power required over a day is about 0.9 watt. 

During the development of the Pl terminals it was found that a single 
design of a transistor amplifier could be used in several different places. 
These included the compressor and expandor amplifiers, the transmitting 
amplifier and the input portion of the receiving amplifier. The circuit 
for that amplifier is shown in Fig. 3. 

The amplifier uses Western Electric NPN grown junction type tran- 
sistors coded 4B for the voice frequency amplifiers and 4C transistors for 
the carrier amplifiers. The first transistor is connected as a common col- 
lector and the second as a common emitter. By using them in this man- 
ner it is possible to employ the same type of transistor in both stages. 
Feedback is obtained by using hybrid coils at both the input and output 
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of the circuit in much the same manner as for electron tube circuits. One 
significant difference is that in this transistor circuit only the second 
transistor introduces a 180-degree phase shift. This permits both input 
and output coils to return to a common ground asina three-tube electron- 
tube circuit and thereby avoids the circuit complications of a two-tube 
circuit where one of the coils must float off ground. A simple resistance 
interstage is used and battery filtering completes the circuit. 


3.3 Low Voltage Protection 


Use of transistors gave rise to the need for supplementary protection 
from voltage surges on the line below those for which conventional carbon 
blocks afford protection. This additional protection was obtained by 
using the reverse voltage breakdown characteristics of newly developed 
silicon-aluminum junction diodes as shown in Fig. 4. Protection is pro- 
vided in the 50- to 1,000-volt range by the diodes and above a nominal 
value of 750 volts by the carbon blocks. During the normally short period 
of operation the small diodes carry a current of up to 10 amperes. 


3.4 System Levels and Carrier Line Loss 


The carrier frequency output power of the transistorized transmitting 
amplifier in the terminals was set at +6 dbm. This level was limited 
primarily by the power handling capabilities of the transistors used. Be- 
cause of the loss of the secondary protection circuitry, band filters, and 
the line transformer, this beeame +4 dbm at the carrier line terminals. 
This is equal to the highest carrier power transmitted by the Type N1 
‘arrier system. With 50 per cent modulation of the carrier, the effective 
sideband level at the transmitting line terminals is only 2 db below that 
transmitted by the Type O carrier system, the most recent carrier system 
used for open-wire long distance trunks of the Bell System. 
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Fig. 4 — Secondary protection circuit in carrier portion of P1 carrier terminal. 
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The +4 dbm output level coupled with noise and crosstalk considera- 
tions indicated that 30-db bare carrier line loss between terminals would 
be possible. A survey of existing and planned Bell System rural telephone 
lines indicated that substantial amounts of entrance cable and open wire 
would be encountered in potential carrier layouts. Calculations of carrier 
frequency loss of those facilities showed that 30-db loss would not be 
sufficient to care for all of the necessary rural applications, which con- 
firmed the need for carrier repeaters. 


3.5 Compandors 


Compandors were incorporated in the Pl system because their several 
advantages more than offset their added cost. The crosstalk and noise 
advantage provided by their use reduced the need for expensive line 
treatment to reduce crosstalk. In addition, the compandor noise advan- 
tage permitted lower received carrier levels to be used, thus increasing 
the permissible carrier line loss. Compandors also eased the requirements 
on terminal and repeater filters, thus reducing filter cost. 

The compandor in the P1 system is a simplified version of the syllabic 
compandor used in Type N1 and O carrier systems, but its performance 
is comparable to those units. The new problem of matching the com- 
pressor and expandor characteristics in Pl terminals operating in differ- 
ent ambient temperatures has been simplified by the use of silicon-alumi- 
num junction diodes in the compandor variolossers and control circuits. 


3.6 Channel Regulation 


Channel regulation was necessary to provide satisfactory transmission 
performance and keep maintenance adjustments to a minimum. The 
regulation was designed to compensate for daily and seasonal carrier 
circuit net loss variations caused by changes in line attenuation with 
temperature. It would be desirable to have the terminal regulation range 
equal to 30 db, the maximum line loss that can be spanned between the 
terminals, to ease engineering layout considerations. However, cost con- 
siderations led to a 15-db range, with span pads used where required by 
system layout to adjust the received carrier power to the center of the 
range of the regulator. 

The regulation in the receiving amplifier is of the backward-acting 
type. A reference control signal, derived from the receiving amplifier 
output, is used to vary the loss of the balanced diode variolosser at the 
amplifier input. The regulator ‘‘stiffness’”’ of 1.6-db change in channel 
voice frequency output for 15-db variation in carrier input is obtained 
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by the combination of the rectified control signal voltage exceeding the 
reference voltage of a silicon-aluminum junction diode and by the ex- 
pansion characteristic of the variolosser. The variolosser uses a specially 
coded set of the silicon diodes which are matched for both ae and de 
characteristics. Modulation products introduced by the variolosser have 
been kept below those produced in the associated receiving demodulator. 


3.7 Signaling 


The need to transmit customer signaling information over a carrier 
channel required the development of means of passing dialing and super- 
vision signals toward the central office. It also required passing ringing 
information to the remote terminal for the types of multiparty ringing 
generally used in the Bell System, including four-party selective service, 
eight-party semi-selective service and divided code ringing. 

These requirements were met in such a way that the carrier system 
can be inserted into a normal voice frequency circuit and function with- 
out requiring any change in the existing signaling equipment in 
the central office or in the customer’s telephone. The central office ter- 
minal is activated by 20-cycle ringing signals which are reproduced at 
the remote terminal. The remote terminal is activated by switchhook 
signals and dial pulses which are reproduced at the central office termi- 
nals. Thus, the two directions of signaling require completely different 
circuits. A block schematic of the arrangement used is shown in Fig. 5. 


3.7.1 Ringing 


The customer signaling originating in Bell System central offices con- 
sists of 20 cycles superimposed on plus or minus battery and applied 
between either tip or ring and ground. These signals control the trans- 
mission over the P1 carrier system of three in-band frequency tones, the 
proper combination of two of them serving to select the party to be rung 
from the far end. The third tone (2,500 cycles), modulated at a 20-cycle 
rate, carries the information as to whether 20-cycle ringing is present or 
absent. In-band frequencies were chosen to encode ringing information 
for transmission over the carrier channel because of the substantially 
lower cost of in-band filters as compared to those required for out-of-band 
transmission. 

The three signaling tones are generated by three transistor tone oscil- 
lators incorporated in a P1 central office terminal. One set of three oscil- 
lators can be arranged to supply four central office channel terminals. 

The transmission of the tones is controlled by three diode-operated 
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keyers which function independently, depending on the nature of the 
ringing signal. The 2,500-cycle keyer responds to 20 cycles applied to 
either the tip or ring conductors. The gating diode is back biased about 
3 volts to prevent random noise peaks from operating it. The 1,750-cycle 
keyer responds to any ringing signal applied to the tip circuit. A diode 
oppositely poled to the gating diode has a high breakdown so that any 
reverse voltage peaks leaking through to it will not open the gate. The 
1,150-cycle keyer responds to 20-cycle ringing voltage superimposed with 
either plus bias voltage applied to the tip or with minus bias voltage 
applied to the ring conductor. 

At the remote terminal the signaling tones are each selected at the 
output of the expandor by tuned circuits, amplified by a transistor and 
rectified to activate relays controlling the customer ringing. The 2,500- 
cycle tone interrupted at a 20-cycle rate controls the remote ringing gen- 
erator, the 1,750-cycle tone determines whether ringing is to be on the 
tip or ring side of the line, and the 1,150-cycle tone whether the bias ap- 
plied to the line is positive or negative. 

The ringing power at the remote terminal is provided by a 3,000-cycle 
transistor oscillator which uses a 2-watt 6A transistor in the second of 
its two stages, as shown in Fig. 6. The rectified output of the oscillator 
is pulsed at a 20-cycle rate by a relay controlled by the 2,500-cycle in- 
band tone and applied to the line through a low-pass filter to reduce the 
harmonic content of the ringing signal. Positive or negative bias is pro- 
vided from one of two clamper diodes. In order to obtain sufficient power 
from the ringing generator, two electrolytic capacitors are used in a volt- 
age doubler configuration. The ringing generator draws nearly 500 mils 
of battery current which, by P1 standards, is a heavy power drain. In 
order to keep this drain to a minimum, the ringing generator is activated 
only during the ringing period. Also the generator is connected to the 
customer’s line only during the ringing period to remove its shunting 
effect on the talking circuit. 


3.7.2 Supervision and Dialing 


Carrier on-off signaling was chosen to transmit supervision and dialing 
signals from the customer to the central office. This method was used 
because of the ease of implementing it and because of savings in de power 
drain at the remote terminal achieved by transmitting the carrier from 
that terminal to the central office only during the off-hook condition. 

An off-hook signal on the voice frequency extension of the remote ter- 
minal, caused by the customer lifting his handset, is used at the remote 
terminal to activate the transmitting amplifier and to remove a short- 
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circuit from its input. This results in carrier being transmitted to the 
central office terminal, where it causes relays in the terminal to recreate 
the customer’s line short across the line connecting the central office 
carrier terminal and the central office switching equipment. 

Dialing at the customer’s instrument alternately opens and shorts 
the voice frequency extension at each dial pulse. Opening of the line 
operates a relay in the remote terminal which short-circuits the transmit- 
ting amplifier input and causes the relays in the central office to follow 
the pulsing of the received carrier, recreate the dial pulses there, and 
operate the central office switching equipment. 


3.8 Repeater 


Repeaters are used in the Pl system whenever the line transmission 
loss exceeds the maximum that the terminals can accommodate. The 
repeaters use transistors for gain instead of electron tubes, but otherwise 
are schematically very much like previous repeaters as shown in Fig. 7. 
The two directions of transmission have similar functions differing only 
in the frequency band that is amplified and the options that are used in 
the gain regulating circuits. 

Identical high-low pass directional filters at each end of the repeater 
separate the directions of transmission into the high and low frequency 
groups. Optional phase correction sections for these directional filters are 
used along a line to improve the phase characteristics in the cut-apart 
region. The directional filters are connected to the line through the same 
line matching coils and secondary protection circuits used in the termi- 
nals. Repeater input span pads are used to build out the line loss to its 
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Fig. 8 — Pl repeater gain-frequency characteristic. 


proper value for the nonregulated repeater and to adjust the power of 
the received carriers to the center of the regulator range for a regulated 
repeater. The output span pads are used where it is necessary to adjust 
the repeater output power in order to equalize levels between a PI sys- 
tem and other P1 systems or other types of carrier systems operating on 
the same open wire line. These pads provide attenuation in 2 db steps 
up to 30 db. 

The repeater amplifiers were designed to have a wide enough frequency 
band to cover both high and low groups of frequencies, so that each re- 
peater contains two identical amplifiers. Each amplifier has three transis- 
tors with each stage connected as a common emitter. Western Electric 
Company PNP 7B and 6B transistors are used in the first and last stages, 
respectively, and a NPN type 4C transistor is used in the second stage. 
Local feedback is required around each transistor to reduce the gain 
spread and phase variations among units. Overall feedback is obtained 
around the three transistors with hybrid coils at input and output. 

The repeater equalizer characteristic represents a compromise for 
several types of transmission facilities generally encountered in the rural 
plant. The equalizer design also covers both the high and low frequency 
groups so that identical equalizers are used for both the high group and 
low group sides of the repeater. 

A preliminary characteristic for the overall repeater gain is shown in 
Fig. 8 plotted against the design objective. There is a significant depar- 
ture in shape only at the cut-apart frequencies. This will be corrected 
sufficiently to permit as many as four repeaters to be used in tandem. 
As the design objective is a compromise of the loss of several types of 
lines that may be encountered in the use of this system, the departures 
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Fig. 9 — P1 Repeater amplifier gain-frequency characteristic. 


in system performance may vary considerably from that shown. The 
repeater amplifier gain frequency characteristic, plotted in Fig. 9, shows 
a non-regenerative peak gain of the three-stage transistor amplifier of 
80 db and a feedback gain characteristic of 50 db. This provides 20 db 
or more of feedback over the transmitted band to produce the necessary 
operating stability with temperature and power supply variations, and 
a working value of modulation suppression. 


3.9 Repeater Regulation 


Repeater regulation will be furnished as an option where variations 
in line loss exceed the terminal regulating range. It will usually be neces- 
sary on systems employing more than one repeater in order to control 
noise performance. Repeater regulation in the direction of transmission 
from central office to remote terminal is controlled by the total carrier 
power of the channels working on one system. In the opposite direction, 
the repeaters will regulate on a low level carrier frequency pilot because 
the channel carriers are not always present in that direction of transmis- 
sion due to their signaling function. The pilot frequencies are shown in 
Fig. 1. 

The repeater regulator, shown in schematic form in Fig. 10, functions 
in much the same manner as the terminal regulator. The principal dif- 
ferences between the two regulators arise from the requirement that 
interchannel modulation must be appreciably less than 1 per cent in the 
repeater. To limit the contribution of the repeater regulator to a small 
value, the variolosser operates into a lower impedance and at a higher 
control current than used in the channel regulator. 
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The input section to the control amplifier is either a flat bridging pad 
for the case where all carriers are always present on the line or a pilot 
pick-off filter and its associated single transistor amplifier where carriers 
are turned on and off for supervision. The latter extra amplifier is neces- 
sary because the pilot power is 20 db below the power in each normal 
carrier. 

The regulator stiffness provided by the repeater regulator results in 
a variation of 1 db in output carrier power for a 10-db variation in re- 
ceived total carrier or pilot power. 


4. COMPONENTS 


Development of the passive components of the P1 carrier system, 
including the various filters and other networks, were influenced by 
three major considerations. The manufacturing cost had to be as low as 
possible consistent with the traditional standards of Bell System service 
life. The components had to lend themselves to maximum utilization of 
the printed wiring techniques to be used as the basic equipment method. 
And lastly, advantage wherever possible was to be taken of the fact that 
transistors are low power devices. 


4.1 Filters 


Component-wise, filters are the most important single assembly de- 
termining the first cost of a carrier system employing frequency division 
multiplexing and frequency separation for obtaining equivalent four-wire 
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Fig. 10 — P1 Repeater regulator block schematic. 
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Fig. 11 — Miniaturized inductor for carrier. 


operation on the line. Past experience has shown that one-quarter or 
more of the first cost is often chargeable to the various filter. The decision 
to employ double sideband modulation was largely based on the knowl- 
edge that when frequency space is available the double sideband channel 
filters are generally the least expensive. 

With this decision made, every effort was directed toward the achieve- 
ment of channel filter designs of maximum efficiency in element. utiliza- 
tion. Inexpensive wide-limit capacitors were used, and the desired per- 
formance achieved through the use of an adjustable ferrite inductor 
expressly developed for the P1 system. The filters are rapidly adjusted in 
the manufacturing process using visual display testing circuits. 


4.2 Inductors 


The inductor which makes this possible is shown in Fig. 11. It is de- 
signed for printed wiring use and provides a wide range of inductance 
while maintaining excellent “Q’’ performance in the carrier and voice 
range. This is accomplished in a single basic design by so selecting the 
winding for particular nominal inductances that the air-gap adjustment 
remains at or near its most efficient setting. Inductors of this type were 
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used not only in all the channel, demodulator, and signaling filters in 
the terminal and in the directional filters at the repeater, but also in the 
channel oscillators and other parts of the circuitry where an inexpensive, 
adjustable element offered manufacturing or service advantages. 


4.3 Capacitors 


Most of the wide-limit capacitors used in the filters are of the com- 
mercially available molded mica type. Where the capacitance values 
would require large and expensive mica units both in filters and other parts 
of the circuit, newly available foil-Mylar capacitors were used. These take 
the form of very small pigtail units in a range of physical sizes similar 
to those of the solid tantalum capacitors described below. The Mylar 
‘apacitors have low working voltages in these miniature sizes and can 
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Fig. 12 — Prototype solid tantalum capacitors for Pl earrier. 
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be employed because of the low voltage protection provided for the tran- 
sistorized circuits. Both cost and space savings are realized in these ca- 
pacitors since no cans or potting are required due to the stability of the 
Mylar dielectric under moisture exposure. 

Another new type of capacitor has found widespread use in the PI 
system. This is a solid tantalum electrolytic capacitor used in place of 
the usual paste or liquid electrolytic capacitor. The solid electrolyte is 
manganese dioxide deposited upon the capacitor surfaces. The anode is 
made from tantalum metal and upon its surfaces is deposited the tanta- 
lum oxide which forms the dielectric. The cathode is an enveloping metal 
completing the capacitor structure. This new design of capacitor is now 
available in values up to 100 microfarads in a very small volume. It is 
expected to be less expensive than other electrolytic capacitors while at 
the same time providing a rugged structure which is relatively inert elec- 
trochemically and which has better stability in operation and storage. 
Fig. 12 shows prototype models of typical solid tantalum units. 


4.4 Transformers 


Transformer needs in the Pl system are met by two miniature struc- 
tures which were made possible by the use of low power transistor cir- 
cuits. The carrier frequency units employ a manganese zinc ferrite core, a 


spool winding and wire terminals which permit assembly on printed 


Fig. 13 — Carrier and voice frequency transformers for P1 carrier. 
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Fig. 14 — Example of P1 carrier line network. 


wiring boards. They are potted with an asphalt compound in a eylindri- 
cal aluminum can. The voice frequency transformers are wound on lam- 
inated core structures of permalloy. The units are potted in an epoxy 
resin in rectangular aluminum cans. The terminal plate carrying the 
wire terminals for mounting is a cast unit of a styrene polyester. Both 
types of transformer are shown in Fig. 13. 


4.5 Line Networks and Filters 


Also deserving mention is a new series of line networks and 
filters (which do not form part of either the terminal or repeater equip- 
ment) with specific functions described in Section 7. All of the networks 
have been designed with the same type mounting arrangement shown 
in Fig. 14 with two sizes used depending on the number of components 
housed. The networks are cast in a styrene polyester. High voltage pro- 
tection is self-contained and sturdy terminals are provided for bridle 
wire connection. By means of side slots in the casting the network is 
mounted on a wedge-shaped holder which is fastened to the crossarm or 
pole. A flexible rubber cover is snapped over the face of the network to 


protect against weather effects. 


5.EQUIPMENT ARRANGEMENTS 


The emphasis placed on economy in this development project made 
it necessary to consider a number of different approaches before deciding 
on the physical arrangement provided for both central office and pole 
mounted equipment. At both locations the terminals for each channel 
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BOG NET 


Fig. 15 — Front and back of typical printed wiring board. 


were treated on an independent basis, thus providing maximum flexibil- 
ity in application. While the use of transistors made it possible to take 
advantage of miniaturized components, the major emphasis in design 
has not been on miniaturization; instead it has been to achieve low manu- 
facturing costs, simplicity of engineering and installation, and a mini- 
mum of maintenance effort. The recent trend toward automation in 
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manufacture of electronic equipment has also influenced the design to a 
great extent. 


5.1 Printed Wiring Boards 


To best meet these objectives, use has been made of plug-in units which 
have proved successful in other carrier systems, such as the N1 and O. 
The assembly technique used here, however, is an entirely new approach 


for carrier equipment in that the plug-in unit consists of a printed wiring 
board on which all components are mounted. Printed wiring, which is a 
comparatively new engineering technique, was selected because of its 
applicability to automatic assembly, including mass soldering of con- 
nections. In addition, the use of printed wiring greatly simplifies testing 
and inspection and assures a more uniform product. The two sides of a 
typical printed wiring board are shown in Fig. 15. 


5.2 Interconnection of Boards 


The interconnection of the various plug-in units or printed wiring 
boards, required to make up a complete Pl terminal or repeater, is ac- 
complished by means of a wire connector specifically developed for this 
project. Basically, the connector consists of a number of accu- 
rately spaced bare wires running parallel to each other and imbedded 
in cross member strips of insulating plastic material. At fixed intervals 
the wires are exposed, and this is where contact is made to terminal con- 
nectors mounted on the printed wiring boards. These terminal connee- 
tors, shown in Fig. 16, are made of spring tempered phosphor bronze 


- Closeup of terminal connectors, 
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and consist of bifurcated cantilever springs, providing a total of four 
contacts for each connection. 

As can be seen in Fig. 17, the wire connector is actually a molded phe- 
nolie box into which are inserted all of the printed wiring boards that 
make up a complete terminal or repeater. The terminal connectors on 
the boards thus engage the wires that are imbedded in the back of the 
connector. To insure contact reliability, a finish of precious metal is pro- 
vided on both the wires of the connector and the terminal connectors on 
the board. Additional flexibility in the interconnection of the boards is 


Fig. 17 — Prototype model of connector Fig. 18 — Typical terminal networks 
box unequipped showing grid wires. mounted in a prototype connector box. 
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obtained by cutting the wires at various points by simply drilling holes 
in the phenol structure supporting the wire. 


5.3 Terminal and Repeater Mounting 

A complete terminal ready for installation at a remote location is shown 
in Fig. 18. The top position in the connector is shown vacant. This is 
where the connections to line and power supply are made by means of 
another plug-in printed wiring board with attached flexible wiring for 
the external connections. 


Fig. 19 — P1 carrier remote terminal in pole mounted cabinet. 
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The equipment described here is equally adaptable to central office 
mounting and pole mounting at remote locations. At the remote loca- 
tions, however, it is necessary to provide the equipment with an outer 
housing which gives protection from all kinds of weather and even from 
moisture condensation. The opened housing is shown in Fig. 19. Fig. 20 
shows a typical remote mounting of the housing on the left. In previous 
electron tube carrier systems the amount of heat generated by the equip- 
ment itself was sufficient to prevent moisture condensation. In the case 


Fig. 20 — Example of remote mounting of P1 terminal and ae power supply. 
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of the P1 carrier, however, the heat dissipation during the idle period is 
less than 1 watt for the entire terminal. To prevent condensation, the 
housing or apparatus case is sealed by means of a neoprene gasket. To 
further reduce the moisture content of the trapped air, the use of a desic- 
‘ant is specified. The apparatus case is made of die-cast aluminum with 
the outside walls finished in white enamel to keep heat absorption to a 
minimum. The system was designed to operate between temperature 
extremes of —40°F and +140°F. This limitation might necessitate the 
additional installation of sun shields in a few cases where extreme temper- 
atures prevail. 

The terminal equipment at the central office makes use of the same 
type of printed wiring boards plugged into a connector as used at the 
remote location. In the central office, however, the outer housing is dis- 
pensed with and the connector is mounted on mounting brackets 
on standard relay racks. The relay rack layouts can be arranged in a 
number of ways to suit the particular installation, since no shop wired 
bays are used. A typical 11’6” relay rack layout will provide for 10 ter- 
minals. No line jacks or alarm features are provided and fusing may be 
obtained from existing fuse boards in the office. The equipment also 
lends itself to wall mounting in locations where relay rack space is not 
available. 


5.4 Testing and Maintenance Features 


One great advantage of the equipment design used in the P1 carrier 
system is the ease with which an entire terminal or repeater can be trans- 
ported to, and installed at, a remote location. In case of trouble, the 
entire equipment unit, be it a terminal or a repeater, can be readily re- 
placed. It is not expected that the maintenance man will attempt to 
replace an individual printed board at a remote location; however, this 
procedure is perfectly feasible in a central office. To facilitate the loca- 
tion of trouble in a unit, the various boards are provided with test points 
located at the outer end of the boards so as to be easily accessible to the 
maintenance man. 

Certain precautions will have to be taken at central repair centers in 
replacing defective individual components in order not to damage the 
printed wiring. Too much heat applied by a large soldering iron will de- 
stroy the adhesive bond between the copper conductor and the phenolic 
board, but repair can be made under certain controlled conditions. A 
limited amount of wiring modifications can also be made to the printed 
wiring by inserting strap wires in place of components. 
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6. POWER SUPPLIES 


The design of a carrier system with low power drain made possible the 
development of a low-cost, reliable de power supply for the carrier equip- 
ment. Because the central office carrier terminal was designed to utilize 
standard central office voltages (24 or 48 volts), only the power supply 
for the remote equipment will be described here. 

Early exploratory studies showed that conventional power supply 
designs would miss the first and annual cost objective by an uncomforta- 


ble margin. A number of unconventional approaches were studied: 

(a) Storage batteries charged over the carrier line. 

(b) Storage batteries placed in service with full charge and removed 
to a central point for recharging. 

(c) Solar power plants. 

(d) Wind power plants. 

(e) Thermoelectric power plants. 

(f) Dry cells. 

In all of the above cases the power plant was either too costly, too 
large, or technically unfeasible, and none could prove in over the con- 
ventional conversion of ac to de where commercial power is available. 
This was true despite need for a storage battery to operate the system 
during ac power failure intervals and to provide peak ringing power. 


6.1 AC Rectifiier-Storage Battery Plant 


The basic elements of the power plant circuit, as shown in Fig. 21, are 
the conversion section represented by the step-down transformer T1 and 
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. 21 — Schematic of ac rectifier-battery power supply. 
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the semiconductor rectifier bridge CR1, the voltage control circuit rep- 
resented by that part of Fig. 21 enclosed in dashed lines, and the energy 
storage circuit represented by the battery. 

Rectification is obtained with germanium rectifiers that are very effi- 
cient, have long life with negligible aging, and are very compact phys- 
ically. The output of this rectifier is not constant, because the output 
voltage will vary with the ac input voltage and the de load current drawn 
by the carrier terminal. Thus a regulating circuit must be provided. 

The regulating network senses the voltage across the battery and 
compares this voltage to a reference obtained from a silicon junction 
diode biased in the reverse direction.’ Any error in the output voltage 
is converted to a current signal in the first amplifier stage and amplified 
by the second stage transistor Q2. The amplified error current is then 
used to control the impedance of transistor QL which acts as a current 
shunt around the battery. 

The fundamentals of the operation of this regulating system are shown 
in Fig. 22. If the load voltage is too high, the network adjusts the re- 
sistance of transistor Q1 so that some of the rectifier output current is 
shunted around the load. The load voltage will then return very quickly 
to the regulated value. Because the rectifier circuit must not be over- 
loaded by a discharged battery, some form of current limiting must be 
provided; this is automatically taken care of by resistor R1. The rectifier 
is capable of supplying indefinitely the current that would be drawn to 
charge a battery after a very long power failure. 

The storage battery is shown in Fig. 23 near the bottom of the power 
plant housing. It is a new design with a h?gh specific gravity sulphuric 


*D. H. Smith, Silicon Alloy Junction Diode as a Reference Standard, A.L.E.E., 
Communication and Electronics, No. 16, pp. 645-651, Jan. 1955. 


- Simplified schematic and regulation characteristic of ac power supply. 
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Fig. 23 — PI carrier ac power plant in cabinet for pole mounting. 


acid electrolyte for good low temperature operation and lead calcium 
alloy electrodes for long life. The battery has 10 cells housed in two jars 
of five cells each. It is operated at about 23.5 volts and weighs about 
ten pounds. It provides about six days’ reserve for a remote terminal or 
about two days for a remote repeater. The battery should not freeze at 
temperatures as low as 40°F, but the storage capacity may be reduced 
90 per cent at this extreme. The battery is mounted on steps so that the 
electrolyte level can be seen through the transparent plastic battery 
jars. Fig. 23 also shows the compact packaging of the entire power supply 
within the same type of aluminum housing as used for the carrier termi- 
nal. A typical pole mounted installation is shown in Fig. 20. Fig. 24 is a 
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close-up of the bottom of the rectifier chassis which shows the regulating 
network mounted on a printed wiring board. 


6.2 Air Cell Primary Battery Plant 


Because ac will not be readily available at all remote locations, an 
alternate power supply has been developed and this is shown in Fig. 25. 


Pi ADJ VOLTS 


Fig. 24 — Close-up of bottom of rectifier chassis. 
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The alternative supply uses oxygen-depolarized primary cells having 
an alkaline electrolyte, and has been used for many years in railway 
signaling circuits and in the telephone plant. Sixteen battery cells are 
connected in series to provide enough power for three years of operation 
of a remote terminal or about one year for a remote repeater. The battery 
is discarded when fully discharged and is then replaced by a new battery. 
7. APPLICATION OF Pl CARRIER TO RURAL TELEPHONE LINES 


The PI carrier system is to be applied to normal exchange loop plant 
facilities engineered in accordance with the present Resistance Design 


Fig. 25 — Primary battery power plant. 
£ y 
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Methods used generally throughout the Bell System.‘ These facilities 
consist of mixed gauges of high capacitance cable extended at their outer 
ends by 109-mil steel, 104 mil-copper or copper steel open wire. 

In engineering the carrier line design or carrier layout, the Plant Engi- 
neer will determine the carrier line layout necessary to meet the over-all 
requirements for a suitable carrier transmission path on the available 
physical facilities. To do so he need only be familiar with the general 
‘apabilities of the carrier system, its basic “building blocks,” and the 
limitations that must be considered in applying the system to the physi- 
cal line. The capabilities of the carrier system have been described in 
earlier sections. From those descriptions it can be seen that the basic 
“building blocks” for a Pl carrier system are: 

1. Central office channel terminals 

2. Remote channel terminals 

3. Repeaters 

4. Ac or de remote terminal and repeater power supplies 
5. Carrier line networks and filters 

A carrier application of these “building blocks” is shown in schematic 
form in Fig. 26. 

The low-pass filters or carrier blocking networks shown are placed at 
the junctions of the carrier line and side leads of customer drops served 
by physical or derived voice frequency circuits on the base carrier facil- 
ity. These filters are required to reduce the bridging loss of the side leads 
at carrier frequencies and to keep carrier frequencies out of the customer 
drops to prevent annoyance to the customers. High-pass filters are pro- 
vided to make the carrier line continuous at carrier frequencies, but 
divide it into isolated sections for voice frequency distribution. 

In addition to these blocks, an autotransformer may be required at 
the junction of the open wire and cable. The autotransformer, either 
alone or in conjunction with a junction line filter, is required to eliminate 
reflection losses and reduce crosstalk at carrier frequencies due to im- 
pedance mismatch between the cable and open wire. The junction line 
filter is required to allow the carrier and physical voice frequency circuit 
to be used on different pairs in the cable and on the same open-wire pair 
beyond the cable-open-wire junction. This is necessary where the physi- 
‘al circuit is so long that load coils are required on the voice frequency 
sable pair and non-loaded cable pairs are required for carrier. A pair 
of junction line filters may also be used to provide a voice frequency 
by- “pass é around a repeater. As illustrated in Fig. 26, this may be necessary 


4L. B. Bogan and K. D. Young, Simplified Transmission Engineering in E 43 
change Cable Plant Design, A.I..£. Communication and Electronies, No. 15 
page 498, Nov. 1954. 
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to serve customers beyond that point from physical voice frequency 
circuit. 

A carrier line termination network is also provided to terminate the 
end of the carrier line at all frequencies and thus prevent reflections from 
interfering with the transmission at remote carrier terminals spaced along 
the line. This network and all of the other line networks are available in 
the pole or crossarm mounted arrangement shown in Figure 14 and de- 
scribed in Section 4.5. 

Fig. 26 also gives examples of two types of subscriber distribution 
beyond the remote carrier terminals. One, wire distribution, is indicated 
by the voice frequency extensions of Channels 1 and 4 and the other, 
filter distribution, is shown for Channels 2 and 3. Filter distribution 
permits the carrier line to be used simultaneously for carrier transmission 
and voice frequency distribution of the derived voice frequency circuit, 
thus saving the pair of wires required if wire distribution were used. 


7.1 Layout Procedure and Ground Rules 


P1 carrier channel layouts for a given rural line will be based on the 
forecast of commercial requirements for that route. The Plant Engineer 
must determine the number and arrangements of channels which can 
be applied within the system limits to meet that forecast. The locations 
of remote terminals are then chosen based on customer locations, channel 
frequency arrangements, and the availability of commercial ac power. 
With the terminal locations fixed, the line losses are determined at ap- 
propriate frequencies and repeaters are specified as necessary along with 
any line networks and filters required for the layout. 

The characteristics and limitations of the P1 system lead to certain 
simplified ground rules which may be used in laying out the carrier chan- 
nels. Some of these rules are summarized in Fig. 27. The stackable fre- 
quency arrangement is used for non-repeatered operation, and the design 
of the carrier channels permits the bare line loss of each individual chan- 
nel to be 30 db at the top frequency between the central office terminal 
and the remote terminal. 

Another limit shown in the figure is that the de loop resistance of the 
voice frequency extension beyond the remote terminal can not exceed 390 
ohms (5 miles of 109-steel wire). The 390-ohm limit is determined by the 
talking battery supply requirements of the 500-type customer telephone 
sets when the battery is supplied from the remote P1 terminal power 
supply. The PI carrier system has been designed to operate with the 
500-type telephone set. The improved dialing, ringing and transmission 
features of that set will help to insure satisfactory performance of the 
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over-all carrier derived circuit. In keeping with the system objectives, 
the over-all transmission of a carrier channel and its voice frequency 
extension, using the 500-type telephone set, will be as good or better 
than that obtained on long rural lines using physical plant laid out by 
the Resistance Design Method mentioned earlier. 

As shown in Fig. 27, the normal and staggered grouped frequency 
arrangements used for repeater operation allow 30-db bare line attenua- 
tion at the top frequency (96 kc) between the central office terminal and 
the first repeater or between repeaters, and about 30 db between the 
last repeater and each remote channel terminal at the top frequency used 
for that channel. Directional filter characteristics limit the repeater sys- 
tem can use a maximum of four repeaters for a total line loss of about 150 
db at 96 ke. However, noise and crosstalk requirements will permit no 
more than two of the four repeaters to be used in the open-wire line, with 
the last cable repeater at least one mile back in the cable from the cable- 
open-wire junction, as shown in Fig. 27. Spacings must be limited to 
somewhat less than 30 db on certain line facilities such as B rural wire 
to insure proper terminal regulation.° 
CENTRAL 


OFFICE __ CABLE AND _______y CABLE OR 
cuneonie OPEN WIRE | OPEN WIRE 


TERMINAL 
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(b) GROUPED FREQUENCY ARRANGEMENTS: REPEATED 


Fig. 27 — P1 carrier application ground rules. 


§C.C. Lawson, Rural Distribution Wire, Bell Lab. Record, pp. 167-170, May, 
1954. 
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In addition to bare line loss, the ground rules make allowance for ap- 
proximately 3 db of miscellaneous losses in any normal channel layout, 
including bridging losses of carrier blocking networks and other terminals 
on the carrier line, insertion losses of high-pass filters, and losses in the 
autotransformer and junction line filters used at the cable-open-wire 
junction. Since these losses do not all add directly, it is simpler to use an 
average loss factor to cover most conditions rather than make compu- 
tations to determine a definite loss for each set of conditions that might 
exist. Thus for channels using a stackable frequency arrangement, a 
maximum of about 33 db loss, including the bare line loss and miscel- 
laneous losses, may be expected between the points where the terminals 
connect to the line. 

A further loss is experienced because the remote terminals are bridged 
onto the carrier line. As a result the carrier power transmitted toward 
the central office terminal is only +0.5 dbm due to a bridging loss of 
about 3.5 db at that point. Therefore, in the remote-to-central office 
direction, the minimum power will be —32.5 dbm (0.5 dbm — 33 dbm) 
at the line terminals of the central office terminal. The minimum carrier 
power in the central office to remote direction will be —20 dbm (+4 
dbm — 33 dbm) at the bridging point of the remote terminal. 


7.2 Terminal and Repeater Location 


In laying out the carrier line design, it is first necessary to determine 
the possible locations for the remote terminals based on distances to 
the customers to be served and the availability of commercial ac power, 
since this is the most economical power source. (When commercial ac 
power can not readily be made available, the primary air cell batteries 
can be used.) Having determined the ideal location of the terminals from 
a physical standpoint, the makeup of the physical circuits back to the 
central office must be determined and computations made of the carrier 
frequency attenuation of the facilities. These loss computations are used 
to determine the number of repeaters required, if any, and their locations, 
once again modified by availability of commercial power. The Plant 
Engineer must also check for the necessity of input and output pads at 
the terminals and repeaters. 

The need for loss computations led to the development of length-loss 
charts so that a carrier line design could be made in a manner very similar 
to the loop cable design using the Resistance Design Methods as men- 
tioned earlier. Fig. 28 shows one of the 96-ke length-loss charts used 
to lay out repeater spacings and Channel 4 over-all circuit design. Fig. 29 
shows the 48-ke length-loss charts as an example of the charts that are 
provided at each carrier frequency other than 96 ke for terminal-to- 
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terminal section layouts of all channels using the stackable frequency 
arrangement or repeater-to-terminal section layouts for channels using 
grouped normal or staggered frequency arrangements. 


7.3 Pad Selection 


The Plant Engineer is given general ground rules for determining the 
values of input and output pads used in the terminals and repeaters. 
Charts are provided for use in determining the input and output pads, 
and they are so arranged that the engineer can take values directly from 
the length-loss charts and enter them into the appropriate slots to cal- 
culate the proper pad values. 


7.4 Crosstalk Limitations 


The Plant Engineer must be given information showing how many 
carrier channels can be applied to each circuit of open wire, cable or B- 
rural wire on a rural route. Crosstalk studies and tests have indicated 
that the stackable frequency arrangement or the grouped frequency 
arrangements used singly or in combination can be used on cable or B 
rural wire with a full system complement of channels applied to each 
pair. However, in the case of open wire, the frequency arrangement and 
number of channels which can be applied is very dependent on the type 


of transposition system used. The R1 design is the most commonly used 
transposition design on rural lines of the Bell System, and Fig. 30 gives 


POLE PAIR 
NONE 
CHANNEL NOS, ! 


(a) STACKABLE: 1,2,3,4 
(b) GROUPED: 


(a) STACKABLE: 
GROUPED: 


FREQUENCY ARRANGEMENT: 
N = NORMAL GROUPED 
S = STAGGERED GROUPED 


Fig. 30 — Number of P1 carrier channels on an R1 transposed line. 
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the carrier assignments for the various frequency arrangements on a 
one and two-crossarm route using the R1 transposition design. Use of a 
transposition system giving better carrier frequency crosstalk perform- 
ance than the R1 design is expected to permit the application of a number 
of additional channels over those shown in Fig. 2. 

It will be noted that the grouped arrangement provides four channels 
less on a two-crossarm basis than the stackable arrangement. From a 
transmission standpoint, equal numbers of channels would be possible 
by assigning one or two channels to a pair, but these arrangements will 
usually be uneconomical for grouped systems because of the high cost 
per channel of repeaters. 


7.5 Line Networks 


The location of the line networks and filters, which are a permanent 
part of the carrier layout, will be designated by the Plant Engineer, and 
the location and type of the remaining networks, which will vary with 
changes in subscriber service, will be selected by the plant forces. The 
low-pass filters or carrier blocking networks used on the carrier lines are 
simple resonant circuits designed to match given ranges of capacitance 
that will be presented by the drop wire or open-wire side leads. Since 
this capacitance varies considerably with various lengths of facility, a 
method will be provided by which the total capacitance of the drop can 
be determined and the proper network chosen. The other line networks 
are applied to the line as necessary to achieve their particular functions. 


8. INSTALLATION AND MAINTENANCE 


A portable field test set has been developed which will simplify the 
installation and maintenance of the Pl carrier system. The new 
set, known as the 7F test set, will provide the carrier and audio frequen- 
cies and a means of measuring them required to align and troubleshoot 
units of the system. The set, which is battery operated, contains a carrier 
oscillator to supply test frequencies from 10 to 100 ke, an audio fre- 
quency oscillator having six selected frequencies in the range of 250 to 
2,500 eycles, a modulator to modulate the carrier frequency signal with 
the audio signal, a demodulator for calibrating the modulated signals, 
and a wide-band amplifier-detector for making level and transmission 
measurements. The model of the set shown in Fig 31 included a pre- 
cision dial for signaling testing which was subsequently found unneces- 
sary and eliminated. An ac operated set providing the same desired fa- 
cilities is now under development. 


Be 
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The carrier channel installation and lineup procedure is set up on the 
basis of using the test set and a generally available volt-ohmmeter to 
make a series of measurements in a specified order. This will permit 
potentiometers to be adjusted as necessary until the specified meter 
readings are obtained at built-in test points. Lineup of the terminals and 
repeaters done first in the central office to insure proper operation and 
then at the in-plant locations to check system performance. 

Maintenance will be handled on a complete terminal replacement 
basis and will consist of making a series of checks with the test set to 


determine whether the terminal is functioning satisfactorily. If it is not 


and it cannot be adjusted to restore satisfactory operation, a replace- 
ment terminal will be used to restore service. All repairs and isolation of 
trouble within the terminal unit or on the individual boards will be han- 
dled at a centralized testing or repair point so as to require a minimum 
of personnel with electronic experience. The test set has been designed 
to handle all tests for a Pl carrier system, when used with the volt-ohm- 
meter, and when used by trained personnel will permit trouble to be 
isolated to a given printed wiring board in the PI carrier equipment. 
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Fig. 31 — P1 carrier 7F test set. 
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An Experimental Dual Polarization 
Antenna Feed for Three Radio 


Relay Bands 


By R. W. DAWSON 


(Manuscript received April 26, 1956) 


The fundamental problems associated with coupled-wave transducers 
which operate over a 3-to-1 frequency band have been explored and usabli 
solutions found. The experimental models described are directed toward the 
broad objectives of feeding the horn-reflector antenna with two polarizations 
of waves in the 4-, 6- and 11-kme radio relay bands. 


INTRODUCTION 


There are at least two communications problems which require fre- 
quency selective filters that operate in waveguides over an approxi- 
mately 35-to-1 frequency interval: (1) channel-separation filters for a 
circular-electric waveguide system in which it is desirable to use the 
medium from perhaps 35 to 75 kme,! and (2) band-separation networks 
needed for the horn reflector antenna that permits simultaneous trans- 
mission or reception in the 4, 6 and 11 kme bands with both polariza- 
tions.?:* The research reported in this paper was directed at deter- 
mining the capabilities of coupled-wave transducers for solving such 
problems. Experimental work was directed toward the second problem 
(above) because it is more immediate. 

Fig. 1, which is a schematic representation of the feed array, comprised 
of three sets of directional couplers, shows that the 4-kme bands are 


mB mF OG Cm 


E a K 


Fig. 1 —- Schematic of dual-polarization feed for three microwave bands. 
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separated one at a time at the antenna end of the array. The 6-kme bands 
are separated next and the 11-kme bands are added or removed at the 
far end of the array. 


GENERAL OBJECTIVES 


Negligible loss of power should result when coupling Th,” waves 
to TEK,,~ waves for the six bands concerned.* The 6- and 11-kme waves of 
both polarizations must pass through the round guide of the 4-kme 
transducers without significant attenuation. Waves in the 11-kme band 
must also pass through the circular guide of the 6-kme transducers 
without appreciable loss. A good impedance match is desired at all 
ports. No cross coupling is desired between the orthogonally polarized 
waves in the round guide. 


FUNDAMENTAL PROBLEMS ENCOUNTERED 


The frequency-selectivity required to separate various bands in the 
same polarization can be achieved in a coupled-wave device by either 


varying the coupling coefficient and, or varying the phase constant, as 
illustrated by the expression for the amplitude of the selected wave:! 


== ! sin 4/1 (1) 
1+ (2: \ 2e 


2c 
where = coupling coefficient 
= length of coupling array 
; 8 = phase constant 


In the present designs some of the frequency selectivity is in the coupling 
holes. The greater part of the selectivity is in the design of the phase 
constants; they are made equal in the band to be selected (8; — B. = 0) 
and very unequal 


— Bs > 2c) 


2c 
in the frequency bands to be passed. 
The size of the coupling hole must be controlled to avoid coupling hole 
resonance in any of the three bands that may be present. This problem 
is especially bothersome in the 4 kme coupler where signals are present 


* As used in this article, superscripts O and © refer to round and rectang- 
ular waveguides, respectively. 
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in all three bands. To keep the coupler length within a reasonable size, 
the individual hole dimensions must be on the order of \o/4 (at 4 kme), 
which will permit coupling hole resonance within the 3-to-1 frequency 
band. A further consideration is the selection of the hole shape to avoid 
perturbing the Thi? wave that is orthogonal to the strongly coupled 
wave. 

Spacing of the coupling holes must nol be \,/2 to avoid: (1) large re- 
flections in the driven waveguide, and (2) large backward-travelling 
waves in the adjacent coupled waveguide. This requirement is easily 
met in the 11-kme coupler where only one band is present; however, the 
presence of signals in two or three bands makes the non \,/2 spacing 
more difficult in the 6- and 4-kme couplers. 

Another phenomena of importance exists in coupled waveguides operat- 
ing over an extended frequency range. A coupling aperture in the side 
wall (see Fig. 1) may interact with a high-order mode at the latter’s 
cutoff frequency, resulting in a significant perturbation of the desired 
coupling. For example, at the frequency where TEx° passes through cut 
off, the coupling between TEy,° and TE," will be perturbed if the coup- 
ling hole is of sufficient size. Small coupling holes do not allow this pertur- 
bation to manifest itself. Coupling holes in a realistic design do become 
large enough to allow this effect to appear. Since dominant mode guides 
in the 4- and 6-kme bands can support other modes in the higher fre- 
quency bands, considerable caution must be exercised in selecting the 
round guide sizes on this account alone. (The size of the round guide is 
determined also by the phase velocity in the rectangular guide). 
SELECTION OF 


COUPLING APERTURES 


A series of holes in either the narrow or broad side of the rectangular 
guide can, in principle, be used to achieve complete power transfer from 
TE." waves to TEn° waves. The specific consequences of coupling 
through holes located along the center line of the broad side will be 
considered first. (Off center holes are not of interest because they couple 
TE,0° waves to both polarizations of TEn° waves in a frequency-sensitive 
way.) The transverse magnetic field 7, and the electric field /, of the 
TE? waves can couple to TE\0° waves. When two fields couple, the back- 
ward wave in the undriven guide can be greater than the forward wave 
in the same guide. To avoid this possibility, transverse slots can be used 
to prevent electric field coupling. The coupling of a transverse slot in- 
creases as the frequency is increased which suggests that 11 kme signals 
be introduced at the position nearest to the antenna because the largest 
tolerable apertures for an 11-kme coupler would not perturb 6- or 4-kme 


< 
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waves. Unfortunately, coupling to slots in this orientation is small, re- 
quiring several hundred for complete power transfer. Such a large num- 
ber would make the coupler too long. 

Coupling through holes in the center of the narrow wall of the rec- 
tangular waveguide as shown in Fig. 2 allows only the longitudinal mag- 
netic field Hz to couple when the electric field of the TE,° wave is 
parallel to the hole containing wall. No coupling exists between the 
TE." waves and the TEn° wave having an electric field perpendicular 
to the plane of the hole. The use of longitudinal slots where practicable 
minimizes perturbation of this wave. Since the desired TEyy” — TE,° 
coupling decreases by 15 db from the 4 kme to the 11 kme bands (for 
1.872 X 0.872” and 2.2” diam. guide), the layout of Fig. 1 suggests itself 
since some coupling discrimination is present for the higher frequency 
waves that pass through the lower frequency couplers. 


DESIGN OF 11-KMC COUPLER 


The objective of this design is to transfer all of the power from a 
dominant mode rectangular guide into one polarization of the TE,° 
forward traveling wave in an adjacent circular guide. Fundamental 
coupled-wave theory’ shows that phase velocities must be matched in the 
two guides to achieve complete power transfer. 

A standard rectangular guide size is selected and the round guide 
size that has the same phase constant is calculated for the center of the 
1,000-me wide band. An approximate total length is selected for the 
series of coupling holes that permits the holes to be spaced approxi- 
mately A,/4 apart. The hole spacing is not critical although the non- 
directional properties of \,/2 spacing must be avoided. The required 
magnitude of multiple discrete couplings is shown in equation (40) of 


Reference 4 to be: 
a = sin {| 
n 


where n is the number of coupling holes and @ is the amplitude of the 
wave transferred at a single coupling hole for unit incident amplitude. 

Equation (3) expresses the power coupled from TEn®° waves to TE107 
waves through a circular hole in a common wall of zero thickness where 
P, is the power propagating away from the coupling point in either 
direction in the undriven guide, and P, is in the driven guide. This deriva- 
tion is based on the work of H. A. Bethe’ and some unpublished notes of 
S. P. Morgan. 


4 
4 
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do ho ¥ (3) 


The quantities a and b are the large and small dimensions of the ree- 
tangular waveguide and 7? is the round guide radius. The wavelength in 
air is designated by \») , and the radius of the coupling hole by r. A cor- 
rection for the finite thickness of the wall is made by considering the 
circular coupling hole to be a round waveguide beyond cutoff.” The ad- 


ditional loss is 
6 
= = (decibels) (4) 
r 


where ¢ is the wall thickness. Total coupling loss per hole is defined by 


20 logioa = 10 log —aA (5) 

The number of coupling holes n is found from the approximate coupling 

ping 

length and hole spacing. Equation (2) is used to find a and then the hole 
radius r is calculated from (3). 

Waveguide dimensions must be corrected to allow for the perturba- 
tion of the phase constants due to the coupling holes. The perturbed 
phase constant* for the round guide is 

o , Vp? 

d 
where d is the hole spacing and p° is the coupling between a pair of round 
guides 


Xo? 


= \ 2 
8 Ao (7) 


The perturbed phase constant for the rectangular guide is 


* This correction is due toS. A. Schelkunoff as noted on page 708 in Reference 4. 
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DIA. = 0.337" 


N=28 EQUIVALENT UNIFORM HOLES 


10.375" 


Fig. 2 — 11-kme coupler sketch. 
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TRANSFER LOSS IN DECIBELS 
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9.8 10,2 10.6 11.0 11.4 
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Fig. 3 — 11-kme coupler transfer loss. 
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The perturbed phase constants are made equal through a suitable 
choice of R and a, this choice being somewhat influenced by the ecoupling- 
hole radius r. 

Three coupling apertures of successively reduced size are used at the 
ends of the array of identical holes to produce four reflections having the 
relative amplitudes of 1, 2.7, 2.7, 1. The modified binomial distribution 
was chosen because impedance matching can be secured over a broader 


INSERTION LOSS IN DECIBELS 


THIS POLARIZATION 
COUPLES 


9.6 10 10.4 10.8 1,2 11.6 
FREQUENCY IN KILOMEGACYCLES PER SECOND 


Fig. 4 — 11-kme coupler insertion loss. 


band (with minor degradation of the center-frequency match) when 
compared to a standard binomial distribution. Amplitude reflections* 
from the start of the coupling array are 


_ Al 


A, = — 
where Q is the reflection from a single coupling hole. 
Fig. 2 is a sketch of the coupler with the final design dimensions. 
Fig. 3 shows the measured and theoretical transfer loss of an 11 kme 
coupler. Fig. 4 indicates the measured insertion loss for the same coupler. 


DESIGN OF 6-KMC COUPLER 


The 6-kme coupler as shown in Fig. 5 utilizes a partially dielectric- 
filled rectangular guide coupled to the circular guide. The use of dielectric 
loading makes it possible for the phase velocities to be equal in the two 
guides in the center of the 500-me wide 6-kme band, and unequal in 


* Information given to 8. kk. Miller by 8. A. Schelkunoff in an informal com- 
munication. 
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N=30 EQUIVALENT UNIFORM HOLES 
FOR ONE ROW 


HOLE SIZE =0.030" x 0.627" 
SPACING = 0.660" 


1.310" 
| 


Fig. 5 — 6-kme coupler sketch. 
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Fig. 6 — Transfer loss of 6-kme coupler. 
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the 11-kme band; thereby low transfer loss is obtained in the 6-kme band 
and a high transfer loss in the 11-kme band. Measurements have shown 
that when the cut-off frequencies of higher modes occur in the band of 
interest an uncontrolled increase of coupling may result. Special precau- 


tions are required in selecting the round guide size to avoid this condi- 
tion. The design process is shown in Appendix I. Figs. 6 and 7 show the 
transfer and insertion losses in the 6-kme band. 


16 


ROUND GUIDE 
INSERTION LOSS 
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INSERTION LOSS IN DECIBELS 


6.2 6.4 6.6 6.8 
FREQUENCY IN KILOMEGACYLES 
Fig. 7 — Insertion loss of 6-kme coupler. 


SLOT SIZE=0.740” x 0.0786” 
SPACING = 0.780” 

POST DIA= 0.300” 

GUIDE SIZE=1.724”" x 0.872” 


COUPLING ARRAY CONSISTS 
OF 38 FULL SIZE SLOTS & 
6 REDUCED SIZE SLOTS 


Fig. 8 — 4-kme coupler sketch. 
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Transfer loss of 4-kme coupler in 4-, 6- and 11-kme bands. 
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Fig. 10 — Transfer loss of 4-kme coupler in 4-kme band. 


4-KMC DESIGN 


The use of periodic loading in the rectangular guide is not suitable for 
use in a 4-kme coupler design. When the phase constants are made equal 
in the 4-kme band, the resulting difference of phase constants in the 
6-kme band is too small to create a sufficiently high transfer loss in that 
band. Periodic loading can produce the desired result. 
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Capacitive rods form a periodic structure in the rectangular guide, as 
shown in Fig. 8, that creates a rejection band in the 6-kme region. Fig. 9 
illustrates how effectively the rejection band increases the transfer loss 
in the 6-kme region. Phase velocities are made equal in the rectangular 
and round guides at the center of the 500-me wide 4-kme band to secure 
a low transfer loss. Due to the rejection bands and the difference of phase 
constants, high transfer losses result in the 6- and I1-kme bands. To 
prevent uncontrolled coupling the round guide size is chosen so that no 
modes cut off in the three bands. Details of the design are covered in 
Appendix IT. Figs. 10 and 11 show the measured transfer and insertion 
losses in the 4-kme band. 


MEASURED CHARACTERISTICS OF ARRAY 


The three pairs of couplers were assembled in a tandem array with 
linear taper sections between them. In the discussions that follow the port 
designations of Fig. | will be used. Transfer loss measurements indicate 
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Insertion loss of 4-kme coupler in 4-kme band. 
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how much power in a forward traveling TE,° wave is transferred to a 
forward traveling TE;o” wave. Figure 12 shows that the coupling polari- 
zation transfer loss remains under 1.1 db in the three bands except for a 
small region in the 11-kme band, while the transfer loss for the non- 
coupling polarization exceeds 20 db in the three regions as shown in 
Fig. 13. The return loss at Port P exceeded 23 db over the 4-kme band. 
This result included the total reflection of 4-kme signals from Taper 
J-K after attenuation by twice the coupler insertion loss, and also in- 
cluded the reflections from the rectangular guide port N or L which 
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Fig. 12 — Transfer losses in the array for coupling polarization. 


are separated from P by only the small transfer loss. Return loss for the 
6- and 11-kme bands exceeded 23 db at Port P. Cross polarization is the 
ratio of the energy in the coupling polarization waves to the orthogonal 
non-coupling polarization waves emerging at Port P. Cross polarization 
figures are no lower than 20, 32 and 22 db in the 4,-6- and 11-kme bands. 


CONSTRUCTION OF COUPLERS 


The coupler design requires that the coupling aperture exist in a 
narrow wall of the rectangular guide that is common to the round 
guide. The 4-kmec coupler consists of machined rectangular and round 
sections. A two-piece rectangular guide was milled from brass and 
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Fig. 13 — Transfer losses in the array for non-coupling polarization. 


soldered to the round guide. The coupling slots were then cut through 
the common wall. An electroforming technique was used to fabricate 
the 6- and 11-kme couplers. A rectangular guide with precut coupling 
holes was clamped to a round mandrel and the entire structure was 
electroformed. The mandrel was later removed by dissolving it in a hot 
concentrated solution of sodium hydroxide. A typical mandrel and 
rectangular guide is shown in Fig. 14. An illustration of the entire 
ensemble appears as Fig. 15. 


DESIGN REFINEMENTS 


Return loss at the round Port P might be improved in a revised 
design by broad-banding the TE,o°-TE,° transfer loss, with an asso- 
ciated increase in the TE,,° insertion loss. This might be done without 
introducing mode troubles, by using ridged waveguide. In Fig. 12 the 
abrupt peak of transfer loss for coupling polarization waves in the 11 


Fig. 14 — Mandrel and rectangular guide. 
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kme band is due to an appreciable coupling of TE,° waves to TE30° 
waves in the 4 kme couplers in the vicinity of the TE cutoff. The TE 
cutoff might be moved above the 11-kme band by using ridged rectangu- 
lar guide. 


SUMMARY 


A combination of three pairs of coupled-wave transducers has sue- 
cessfully permitted six distinct. bands to be fed from individual TE;¢ 
rectangular waveguides into the two orthogonal polarizations of TE), 
waves in a multi-mode round waveguide. The resulting structure en- 
abled low transfer loss values to be obtained simultaneously over two 
500-me wide bands and one band 1000-me wide distributed over a 3-to-1 
frequency interval. 
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APPENDIX | 
DESIGN OF 6-KMC COUPLER 


A rectangular guide size is chosen and a reasonable value selected for 
the phase constant such as B~ = 1.59/X). The resulting round guide 
size must prevent modes from cutting-off within the 6-kme band and 
preferably the cut-off frequency should be above the 6-kme band. The 
thickness of the dielectric (polystyrene) strip is determined’ from (11), 
(12), and (13) where K, and KA, are transverse wave numbers of the air- 
filled and of the dielectric sections of the guide. 


= — — 2 ( 
V (=) Bp 11) 


2r\" 
(7) Bo 12) 


Cot Kod = — = cot Ki(a — d) (13) 


In the above equations ¢, is the relative dielectric constant and d is the 
slab thickness. After solving for d the equations are resolved for Ay and 
Ke values at the 6-kme band edges and also at the lower end of the 11-kme 
band. Resulting rectangular-guide phase constants should cause a very 
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low and a high value of transfer loss as indicated by (14) which repre- 

sents the forward traveling coupled wave amplitude where ex is the total 
4 

coupling strength. 


= =? sin / 


At midband ex = 2/2 and experience has shown « & 10 Xo ; therefore 
¢ & 7/20) at midband. The coupling hole radius is found from (3). 
Because longitudinal slots are used instead of round holes the equivalent 
hole radius r is found from $r° = P¢’, where P is the magnetic polariza- 
bility’ and ¢ is the length of the chosen slot. To avoid slot resonance in 
either band, the length was chosen to be approximately \»/4 in the 
6-kme band and 2d) in the 11-kme band. The power expression of (3) 
must be corrected by the wall thickness effect (4) and also multiplied 
by the factor F due to the presence of the dielectrie slab’ (15). 


— sin 20,) + — 3 sin 26)| 
cos 


cos 


(15) 


d) Ay = hod A = ( 


Theoretical coupling loss per hole is defined by 


20 logyo log — (A + 10 logio F) (16) 
1 


An additional correction which reduces the coupling loss is due to the 
long length of the slot. Although the slot resonates near 9 kme, an in- 
crease of 3 db in a single slot coupling results at 6 kme. This effect was 
found experimentally from a sample test line with several slots. To avoid 
excessive length two rows of coupling slots were employed. They were 
staggered to improve the continuity of coupling from discrete points. 
An approximate design is on hand at this point. The final dimensions 
for the guides and coupling holes are found after the perturbations of the 
phase constants are considered by the same process as noted in the dis- 
cussion of the 11-kme design. Impedance matching for the dielectric 
strip and the coupling slot array was patterned after the technique 
shown for the coupling hole array in the 11 kme design. 
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APPENDIX IT 
DESIGN OF 4-KMC COUPLER 


A round guide size is selected so that no modes are cut-off in the three 
bands. Coupling power varies with wavelength as shown in (17) for 
TE» to coupling. 


[1 - | 


To maintain the minimum variation across the band, the following re- 

quirements must be met which were deduced from the coupled wave 
4 ~ 

theory and (17). 


sin (ex); = sin (cv) (18) 


3.413R 
(ex), + (ex)o = (20) 


The equations are solved for sin cx (the minimum band edge transfer 
loss) which for a diameter of 2.10” is 0.5 db. A value of 30 db is chosen 
for a, (2), as the first approximation. Because the band edge transfer 
loss is 0.5 db due to frequency variation of coupling, an additional loss 
of only 0.2 db is allowed for the phase constant difference (8,7 — B°). 
It is now necessary to make B° = 8." , where 8," is the phase constant 
of the loaded rectangular guide. Equation (21) gives a theoretical periodic 
loading formula where L is the spacing between loading elements.” 


cos = A cos (8°L + 6) 


® = art tan = Wit 


Assume initially that L = 2/8°. The required susceptance bo of the ca- 
pacitive rods can be found experimentally from a loaded test line by 
varying bo until the first rejection band covers the 6 kme band. An 
iterated process is used to find L because it is dependent on 8° which is 
the parameter being sought. Measurements indicated that (21) does not 
predict B° very accurately and for that reason an experimental adjust- 
ment of (8,7 — B°) is desirable. 

The guide dimensions are now known; however, they must be cor- 
rected for the perturbations of the phase velocities as outlined in the 11- 


(21) 
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kme coupler design. A single row of longitudinal coupling slots which 
are not resonant in the 6- or 11-kme bands is used. The radius of a round 
hole equivalent to the slot is obtained as in Appendix I, by setting 
= Pe’ 

Impedance matching of the coupling array and of the loading elements 
is accomplished by tapering the amplitude of the end elements as indi- 
cated by a discrimination function of the coupled wave theory.’ For a 


5 element series 1 —n; — I 
2(n 1) + Ne 


SrZ 
2 (n COS = + ¢os = + ny 
Ag. 


D = 


where Z is the spacing between elements. 

The discrimination D is set equal to infinity which permits the de- 
nominator to be set equal to zero and solved simultaneously for n; and 
ne by using both band edge wave-lengths. Round hole sizes are readily 
obtained since the coupling coefficient is directly proportional to the 
cube of the hole radius from which the necessary equivalent longitudinal 
slot can be calculated. Susceptance values of the capacitive posts are 
found from the absolute value of the reflection coefficient which equals 


bo 
+ 4 
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The Character of Waveguide Modes in 


Gyromagnetic Media 


By H. SEIDEL 


(Manuseript received August 31, 1956) 


A magnetized gyromagnetic medium is birefringent. The effect of bire- 
fringence is studied in rectangular and circular waveguides with special 
attention paid to propagation characteristics in guides of arbitrarily small 
cross-section. Propagating, small-size structures are found in certain ranges 
of magnetization for both types of guide. 


I, INTRODUCTION 


A gyromagnetic medium, isotropic in the absence of a magnetizing 
field, becomes axially symmetric with respect to that field when mag- 
netized. A tensor susceptibility! is thus produced which reflects the 
resulting anisotropy. Two essentially different types of rays appear in 
the medium in much the same manner in which the ordinary and extra- 
ordinary optical rays form in a caleite erystal. These rays may combine 
to produce results in a ferrite loaded waveguide quite alien in character 
to those of a conventional isotropic guide. Since the ferrite is, to first 
order, characteristic of general gyromagnetic media we shall discuss all 
gyromagnetic phenomena in terms of ferrites alone. 

One very startling phenomenon observed in ferrite loaded waveguides 
is the occurrence of propagation in a waveguide of arbitrarily small 
transverse dimensions.? We shall show that this type of wave guide 
behavior is a consequence of the particular form of the birefringent 
character of the medium. 

In order to understand the nature of the ferrite loaded case let us first 
consider the conventional isotropic small wave guide. Fig. 1 shows, 
schematically, the field distribution encountered in a small rectangular 
waveguide operating in a (1,1) mode. The x axis is shown along the 
wide transverse dimension and z is along the narrow height dimension. 
The y axis is chosen to coincide with the guide axis. 

The field solutions of such a waveguide may be obtained as a super- 
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Fig. 1 — Rectangular waveguide mode in isotropic medium for cutoff guide. 


position of plane waves of dependence ¢~#*'®. If we represent Rk in a 
‘artesian frame, the wave equation is satisfied for the condition 


+h? = 


where » and ¢ are the permeability and permittivity respectively of 
the medium. Satisfaction of wall boundary condition requires that /, 
and k, be real and that each be of the order of the reciprocal of the 
transverse guide dimensions. Small transverse dimensions thus cause i, 
to be negative, driving the waveguide into a cutoff condition. 

We shall now find that birefringence permits another class of modes 
in the small size ferrite loaded waveguide. Letting the magnetic axis be in 
the z direction, it will be shown in the text that corresponding to any 
mode of the guide k, and k. are unique. Birefringence generally requires 
that two different magnitudes of Rk occur simultaneously, causing two 
different values of k, to appear. In particular, let us postulate that both 
these values of /, are imaginary. Given two exponentials, it is possible 
now to satisfy the requirements of electric field nulls at either side wall, 
as shown in Fig. 2. At the other side wall we shall show that the ex- 
ponentials decay so fast as to effectively cause the field to vanish there. 
Since ki,» are now negative quantities, there is no contradiction in pre- 
suming that /,’ may now be positive, thus permitting propagation in an 
arbitrarily small size waveguide. 

The effect of birefringence may then be that of transforming a class 
of longitudinally cutoff modes into another class that propagates longitu- 
dinally but cuts off transversely. The condition of this occurrence will 
be shown to be that for which the diagonal term of the Polder tensor, yz, 
is positive and is lessin magnitude than the magnitude of the off diagonal 
term x. In the case of a small rectangular guide, propagation occurs 
anomalously for negative values of yw, as well but in a manner not as 


Z VARIATION 
x 
X VARIATION 
Kite 
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substantially dependent on the birefringent character of the medium for 
large width to height aspect ratios of the waveguide. We shall find, 
further, that propagation occurs with entirely real values of k, and k, . 

It will be shown that the proper wave equation for one of the two 
birefringent rays is satisfied in the small waveguide limit by the rela- 
tionship 


ko + ky + = 0. 


In the region of » > 0, and /, real, we confirm somewhat more rigorously 
the requirement stated earlier that either k, or /, be imaginary. However, 
k, and k, may both be real over a range of negative values of u, permitting 
boundary conditions to be satisfied, approximately, in waveguides 
having aspect ratios of the type discussed earlier, by Just one class of 
rays in the small size waveguide. 

Propagation in small size circular guide employing the essential charac- 
ter of birefringence, occurs over the entire range of |u| < > «.. This 
range is divided into that of » > O and that of uw < 0. Transmission 
occurs in one sense of circular polarization in each of these regions and 


for both senses for «4 < 0. Thompson” has suggested that propagation in 


a small circular waveguide might be attributed to the negative permea- 
bility of one preferred polarization; it appears, however, that propaga- 
tion is possible over a considerably wider range of conditions and 
for somewhat different reasons. 

In the case shown in Fig. 2, higher propagating modes occur in a 
rectangular waveguide when one half or more sinusoids of field varia- 
tion occurs in the z direction. These simply produce the result of stronger 


RESULTANT —- 
X VARIATION 


Fig. 2 — Mode in ferrite filled rectangular guide. 
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transverse cutoff. Therefore, by demonstrating the existence of the 
lowest order mode we show that an infinite number of these anomalous 
modes may propagate simultaneously. These modes are, however, 
bound very tightly as surface waves to the side walls of the guide because 
of their strong transverse cutoff. The medium is therefore used in a 
very inefficient manner and high loss results, the loss increasing with 
mode number. 

The higher propagating rectangular waveguide modes have an ana- 
logue in the higher propagating modes in a ferrite filled circular wave- 
guide. This analogue occurs in terms of the integral number of peripheral 
variations. We find, similarly, an infinite number of such propagating 
modes each one corresponding to a given polarization sense and having 
a given number of peripheral variations. The reservations on practical 
transmission still hold in the same manner as in the rectangular case. 

In the course of preparing this publication it was brought to the 
author’s attention that Mikaelyan‘ employed an analysis similar, in part, 
to that developed here. It is felt, in the present analysis, that the physical 
results are made more readily evident by a consideration of the limiting 
case of small guides, with large ratios of width to height in the case of 
rectangular waveguides. The choice of such large ratios is made to 
simplify analyses involving imaginary values of kz and hk,» , wherein 
the wave is considered to be bound to one wall of the guide and reflec- 
tions from the opposite wall are of negligible amplitudes. 


Il. ANALYSIS OF TRANSVERSELY MAGNETIZED FERRITE IN RECTANGULAR 
GUIDE 


The character of the ferrite medium is introduced through the Polder 
permeability tensor: 


The quantities « and « relate to the self and inductive permeabilities 
transverse to the z axis. The relative permeability along the z axis is 
given as unity. These permeabilities may be expressed as follows in 
gaussian units.’ 


(2a) 


(2b) 


yp O (1) 
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y = 2.80 Mc/sec/oersted 
wo = yHy 
Hy = Internal de magnetic field 
= Saturation magnetization 
Maxwell’s equations are given as: 
Curl H = weE (3a) 
Curl E = —iwpoT'-H (3b) 


Assuming a plane wave of dependence e%“*-**®), and appropriately 
combining (3a) and (3b), we have, 


[kk — KI + wepeT)-H = 0 (4) 


The operator in square brackets is a dyadic which may be repre- 
sented in matrix form. The quantity / is the idemfactor, having a unit 
diagonal representation. If we are to require that a non-trivial field H 
exist, the determinant of the operator in (4) must vanish. Since all rays 
traveling perpendicularly to the magnetizing axis are equivalent the 
medium is degenerate in the transverse plane, and some simplification 
is achieved in causing R to lie in the yz plane and letting k, = 0. Some 
further simplification is achieved in normalizing the Polder tensor such 
that 

fig 0 
—ig f 0 (5) 
0 


The following secular equation is then formed. 
—k +f ig 0 
—ig +f kk 
0 


Introducing the substitution p = k./k’ , and recognizing that 


k.? k2 p 
Cy Se 


we have upon expanding (6), 
— + ke? — fh — 
+ pl(h — — + fh — + kf = 0. 


| 
| 
- 


414 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1957 
We note, in general, two solutions in p corresponding to each value of 
hk, , indicating birefringence of the medium. In particular k, must be 
non-vanishing for birefringence to occur or, stated alternatively, rf 
field gradients must exist parallel to the applied magnetic field to obtain 
birefringence. 

The characteristic vector solutions of (4) may be expressed for each 
solution of (7); they are the magnetic fields, 


—i(kyytkz2z) ) 
i(kyytk, (8) 


| 


| 


The sign indeterminacy above is defined with respect to the ratio 


i, /k., the upper sign being given by the positive value of this ratio. 
oS . 

We shall analyze the rectangular waveguide by first seeking parallel 
plane solutions and then utilizing these solutions to form those of the 


rectangular guide. We choose as parallel planes those perpendicular to 
the applied magnetic field, or z direction and having a separation b. 
Because of the absolute uniformity of this type of structure, the field 


configurations as a function of the coordinates transverse to the magnetic 


field, « and y, may change only by a uniform phase factor. Again, the 


choice of transverse axes is made such that these phase variations oecur 
only along y. 
From (7) we would find that a specification of /, leads to a quadratic 


equation in p, with an appropriate consequent multiplicity in hk. Let 


us define as a partial wave any standing wave in the z direction corre- 


| 
p 
| ) 
| 
. 
is 


WAVEGUIDE MODES IN GYROMAGNETIC MEDIA 415 


sponding to some linear combination of the positive and negative values 
of k, for one of the values of kZ2. Examination of (9) reveals that the 
ratio of £, to F, , the field components tangent to the bounding walls, 
to be independent of the sign of k,. Hence, each partial wave has an 
individual value of this ratio irrespective of its standing wave distri- 
bution in the z direction. It is thus impossible, in general, to provide 
a mutual cancellation of two or more partial waves at the electric 
walls by combinations of such partial waves, with the consequence that 
‘ach partial wave must individually satisfy the boundary requirement. 
We find, then, that each partial wave takes on the familiar condition 
k, = ma/b. 

The parallel plane waves now will be appropriately oriented and 
superposed to satisfy the side wall boundary conditions in the rectangular 
guide. Since, as shown in Fig. 2, mutual cancellation is required on the 
side walls of the rectangular guide, the rate of vertical variation must 
be identical for all the component parallel plane waves; thus m is a 
constant of the waveguide mode and k, is uniquely specified. 

Two essential characteristics thus define a rectangular waveguide mode 
in a transversely magnetized, ferrite filled, medium, 

1. The modes are ordered by integral values of m in the relationship 


z 


k, = ma/b. 

2. The propagation constant /, is uniquely specified. 

Standing waves may now be formed in the z direction satisfying electric 
boundary conditions at the parallel planes. Each partial wave of the 
electric field may then be expressed as follows corresponding to its 
appropriate value of p: 


—i(ma/b) (1—p/ p) 4, 


it? 


b 


Let us now specialize our analysis to the small guide case. The require- 
ment of birefringence to produce small guide propagation demands 
that k&, be non-vanishing and that m take on an integral value of unity 
or greater. We have, from (7), the two limiting values of p corresponding 


= — | b 
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to a small value of b, 


Pr 


2 f —h ke 1 
—fh-¢ 2 (11b) 


Disearding the z dependence in equation (10) and dropping a constant 
multiplier, the two characteristic electric field solutions become: 


— 


1) | (ma/b) 
E’ =|. (12) 


(15 


Equations (12) and (13) are parallel plane solutions obtained for 
some arbitrary direction, y, transverse to the magnetic field. This direc- 


tion need not be intrinsically real; mathematically, it simply satisfies 
Maxwell’s equations. We may transform to a desired waveguide frame 
of reference by rotations ¢g; and ¢g2, corresponding to p; and ps, about 
the z axis, where these rotations may possibly be made through complex 
angles. We then have for the electric fields in the new space: 


K 


( | 1) 


EY 


1 
(? sin gi + 2 COS | 
| K 


) 


(i SIN ¢» 


E™ — |i 
li J 


The new y axis of the transformed coordinates is now considered the 


(mr/b) 
(15) 


longitudinal axis of the waveguide. 
The partial wave fields of (14) and (15) may be joined to form a single 


= 
a 
| 
} 
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mode by equating the propagation constant. Therefore, 

COS ¢2 COS (16) 
where cos ¢2 is imaginary for propagation. Propagation may therefore 
occur for > 0 and cos imaginary and /or, < 0 and cos real. 

Boundary conditions require /, and F, to vanish at both guide side 
walls. Four equations result which may be satisfied, in turn, by a super- 


position of four transverse waves involving kr. , and —k,, 


corresponding to values +¢),2. For » > 0 both of the birefringent rays 
have transverse decay. Since the magnitudes of k,,, are large in small 
size guide (see Introduction) boundary conditions need be satisfied for 
practical purposes at only a single wall. We are then left with the sim- 
plification of only two equations in two unknowns. 
Setting « = O1in (14) and (15) and taking equation (16) into account, 
we have the boundary conditions 
(l — p) 


sing: + cos | + Blin cos = 0 
K 


With the result that 
cot gi = 


Choosing fk, positive real, k,, is positive imaginary for « positive and 

negative imaginary for « negative. The rf field therefore hugs the right 

wall for x > O and the left for « < 0, or, alternatively, switches sides in 

the change from a forward to backward direction of propagation. 
Equation (19) may be written equivalently as 


(20) 


Propagation, occurring for imaginary values of cos ¢ and un > 0, is ob- 


tained for < 
Let us now analyze, the possibility of small guide propagation for 
uw < 0. We find, from (16), that cos g; is real for this case. Two cases 
arise; the first for which | cos g | < 1 and the second for the reverse 
situation. 
Let us first consider the case of | cos g | < 1. From (14), k., is real 
whereas from (15) /,, is imaginary. Let us associate wave amplitudes 


$17 
‘ 
(19) 
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with x dependences as follows: 


Be*='* 


Ce or 


where a is the guide width. Let us assume that k,, is a sufficiently large 
imaginary quantity of such sign that 


g |] 


This assumption will be seen to be consistent with the solution. [(26b) 
for small size guide.] 

Setting up the boundary conditions for FL, and F, at « = 0, we have 
from (14), (15), and (16), 


—1\. 
(A — B) ) sin gi + (A + B) cose, + cosg, = 0 (21a) 
K 
(A+ (21b) 
let r = B/A. Combining these last two equations we have 
l-—r kx 
(22) 
Satisfying the boundary conditions at « = a produces an equation 


similar to (22) with the substitution 


r— re i2k re. 


Thus 


1 me. = 1 re (23) 
Equation (23) is satisfied by the condition \ = mm. Since kz, = 


i(mm/b)u sin , we have 


(24) 


The assumption that cos ¢: is real and less, in magnitude, than unity is 
realized by the condition 


(25) 


ys: 
A 
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Only in the limiting condition of a infinitely greater than b do all modes 
(m, n) propagate in the negative region of uw. In this particular case, 
sin g: = 0 and we find from (21a) and (21b) that C = D = 0. Thus we 
find a situation in which the guide boundary conditions are satisfied by 
but a single class rays of the two classes available. 

This result is entirely comprehensible if we observe the wave number 
relationship obeyed by Rk; and ky. Employing the definition of p which 
states that k° = k.’/p, and using (11a) and (11b), we have 


ke, key, = 0 (26a) 
key +hy +k: =0 (26b) 


As stated in the Introduction, it is an entirely consistent procedure to 
satisfy boundary requirements with real wave numbers over the negative 
range of w using the class of rays indicated for (26a) above. 

More generally, (25) shows a complex relationship of the ordering 
of propagation modes by n and m, for finite a, for a given negative value 
of uw. In contrast to the uw > O case, propagation may possibly not occur 
for a range of lower order integral values of m. As uw becomes increasingly 
large in magnitude, m must likewise take on increasingly higher values 
for transmission to occur. 

The case of cos ¢g, real and greater, in magnitude, than unity, leads 
to trivial result. Both partial waves have imaginary values of /, , for 
this case, and the far wall receives essentially no coupling. Analysis 
simply repeats the result of (20) and we find that | > «| andy < 0. 
If the Polder tensor components given in (2a) and (2b) are plotted (see 
Fig. 5). We find that this last set of inequalities form an impossible 
combination. 

Summarizing we find that a rectangular waveguide of any dimension 
(and, in particular a guide of arbitrarily small dimensions), filled with a 
lossless transversely magnetized ferrite medium, will support an infinite 
number of freely propagating modes at any frequency for which |u| < 

x |. The character of these modes differs considerably in the two regions 
of w < Oand uw > O and somewhat different viewpoints of propagation 
must be taken. We shall find similar results relating to the longitudinally 
magnetized ferrite filled circular waveguide in the following section. 


Ul. ANALYSIS 


GUIDE 


OF LONGITUDINALLY 


MAGNETIZED 


FERRITE IN) CIRCULAR 


We now proceed to a second structural geometry in which an anoma- 
lous behavior occurs attributable to the birefringence of the medium. 


: 
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This is the circular guide which has been the subject of considerable 
analysis by Suhl and Walker. It is instructive, however, to repeat the 
analysis of this case, in the small guide limit, showing more pointedly its 
behavior from the viewpoint of combinations of the two types of waves 
in the medium. 

The character of transmission in undersized circular waveguide is 
very similar to that of the undersized rectangular case. We may demon- 
strate the physical significance of this statement by the following argu- 
ment. The excitation in a rectangular waveguide, for |u| < |«{ and 
un > O, is essentially that of a surface wave bound very tightly to a 
single wall. Considering this wall alone, which may now be extended to 
arbitrary dimensions but with k, kept large, it may be wrapped upon 
itself either about the magnetic field as an axis or containing the mag- 
netic field peripherally. In either event, the wrapped guide must start 
and terminate at the same phase, requiring a multiplicity of 27 around 
the circumference, and the wave must thus continue to have a large 
k, value. Considering the large value of /, and the state of excitation of 
the ferrite, the small circular guide may propagate. 

Analysis will demonstrate that propagation also takes place in the 
region uw < 0. The quantity k,, is real and /,, imaginary, see (26), leading 
to a case essentially similar to that of the rectangular waveguide. The 
analogy is appropriate to the case of b/a of finite value for which the 
rectangular guide requires the appearance of both refractions. We now 
proceed to obtain the field solutions for the circular guide. 

Referring to (9) for the plane wave solution of the electric field, let 
us define to within a constant multiplier. 


| 


where, for the case of large k, (9, 12, 13) 


We shall consider here, of the two possible wrapped-wall structures, 
that case in which the magnetic field is applied axially as shown in Fig. 3. 
Referring to Fig. 4, the cylindrical drical electric wave satisfying Max- 
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Fig. 4 — Transformation to polar coordinate frame. 
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well’s equations and the boundary conditions for this structure is ob- 
tained by integrating plane waves of the form of (27) traveling at all 
possible angles y, the integration being subject to a weighting factor 
G(y) to obtain the most general field. The coordinates (r, ¢) refer to the 
physical system and the coordinate y identifies a plane wave traveling 
along a particular y axis. We have thus in an (r, ¢, z) coordinate frame: 


cos¢ sing 0) (iE, ) 


E = Gy) |} — sing € (28) 


0 


Recognizing that 


dé 


y=rsin¢g 


and = G(t — ¢) 


an integration results over the variable ¢. Because of the uniqueness of 
the field as a function of ¢g, the only term containing ¢g, G(f — ¢), must 
be a periodic function in its argument. A typical mode is formed by 
choosing one of the terms of the Fourier series of G(¢ — ¢), namely 


ein 


We find from (28) that 


+ Eyl 
p 


(p) E, - J 
| p 


E.J 


where E,, is that partial expansion of the total field BE, corresponding to 
the number of angular variation n, and p = ir. There are two values of 


p corresponding to the two values of /, , and each leads to a partial wave. 
Let A and B be the respective partial wave amplitude; satisfying the 
boundary conditions on /, and E,, we have from (29): 


1—y\,, A 
A ( - — J — Jn(p2) = 0 (30a) 
K pi 


Au +- BJ ,,(p») = () (30b) 


where p; and p» are defined for r = PR, the radius of the evlinder. Recog- 
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nizing that pp = uw Se we have from (30) 


J n(pr) 


Pl 


= (0* (31) 


+n 


where = IR. 
Equation (21) may be modified by a recurrence relationship to become 


n J,(p1) 

For « > 0 the quantity p: is a pure imaginary for large real values of 

i:, . Since the n“” order Bessel function is monotonic in imaginary argu- 

ments and possesses the multiplier (7)", the right-hand side is negative 

for n positive. For n > 0, propagation occurs for 


> 
sgnk = —sgn 


Inspection of (31) reveals that a reversal of the sign of n is equivalent 
to reversing the sign of x. This conforms to the physical situation in 
which reversal of the sense of circular polarization is equivalent to the 
reversal of magnetic field. Thus for n < 0 and u > 0, 


l«| > 


sgn = sgn yp 


We find, from the above arguments, that just one sense of circular 
polarization propagates in an undersized circular guide for » > 0 and for 
a given direction of the magnetic field. It will be demonstrated shortly 
that propagation occurs for » < 0, but with an entirely different struc- 
ture of modes. The right-hand side of (32) is monotonic as a function of 
p for » > O, leading to only one solution for each value of n. This will 
not be the case for u < 0. 

It is of interest first, however, to observe the limiting approach to 
uw = 0 in the region of u > 0. The right-hand side of (32) is finite for 
finite imaginary values of p; , so that the only solution as » approaches 
zero is that for which the magnitude of p,; becomes infinitely great. The 
Bessel function is asymptotically expansible as a cosine divided by a 
square root of its argument. Thus 


1 2n+1] (w/4)) —i(py+[2n ( 
J .(p1) = + ¢ i(py+[2 (33) 
Pl 


* Equation (31) may likewise be obtained from the small radius limit in (34) 
of Reference 2. 
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Considering p; to be positive imaginary, as p; — 7%, (32) becomes 


1p) 
n 


Substituting for p; , we have 


(35) 


Thus, as « approaches zero from values greater than zero, the propaga- 
tion constant tends to become singular. Physically, however, u does not 
vanish but approaches a small imaginary value caused by ferrite losses. 
The propagation constant /. becomes complex and takes on a large 
imaginary component, signifying large guide attenuation. Since these 
losses occur in the limited neighborhood of « = 0, we may construe this 
waveguide behavior as corresponding to a system resonance. 

In the region «7 < 0, p; becomes real while p: remains imaginary. The 
right-hand side of (32) is now composed of only real arguments. Since the 
zeros of different order Bessel functions alternate, the right side of (32) 
contains a succession of poles and zeros, leading to an infinite number of 
branches with each containing a solution p; to the equation. Thus there 
are an infinite number of propagating modes corresponding to each value 


Fig. 5 — Frequency characteristics of Polder tensor components. 


(3: 
2 
a: 
; 
— 
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of n, in marked contrast to the case of w > 0. The solutions remain identi- 
cal, as before, if both « and n are simultaneously reversed in sign, but 
differ if only one of the two quantities is reversed. 

Since = is a branch point, the limiting condition as approaches 
zero for values w < 0 differs from that for the reverse case. Equation (32) 
is now satisfied in the limit of small « by the real zeros of /,,(p,). Since 


these roots are finite, ’. , equal to —(—wp)’p,/R, tends towards zero for 
p 


all modes. Since the formulae developed in this paper always presume 
large wave numbers, we may infer a vanishing value of /. to simply 
represent a value which is small relative to the reciprocal of the wave- 
guide radius. In any event, /, is no longer singular at « = 0, and there 
is no resonance in the approach from negative values of wu. 

In sum, the features of the circular guide strongly resemble those of 
the rectangular guide in the region of «7 > 0. This was to be anticipated 
by the “wrapped wall” construction where the wave is tightly bound 
to the wall. The wrapped equivalences do not hold in the region u < 0 
since, with harmonic transverse dependence, the wave is no longer 
bound to the wall. This lack of equivalence is manifested in the matter 
of ordering modes. lor a rectangular waveguide of finite aspect ratio, we 
find from (25) that there are but a finite number of modes corresponding 
to each value of m for «7 < 0. The circular guide differs in providing an 
infinite number of modes corresponding to each value of n. Further, 
whereas the circular guide covers the entire range of uw) << «|, (25) in- 
dicates that the various modes of the rectangular guide covers a more 
restricted range determined by the guide aspect ratio. 


IV. CONCLUSIONS 


The waveguide behavior analyzed in this paper has been experi- 
mentally observed? and good correlation has been obtained. From the 
viewpoint expressed of forming a guide cross-section by wrapping a wall 
to which a surface wave is bound, we may anticipate that the unusual 
behavior observed in the two types of guides examined is probably 
characteristic of many other structures. 

It is not clear, at this time, if the complete set of modes of either the 
rectangular or circular guides have been exhausted. We already observe 
that an infinite number of modes propagate simultaneously so that 
scattering problems become considerably more complex than in the usual 
‘ases. It is felt by the author that the field of waveguide analysis calls 
for new methods and techniques of modal synthesis when ferrite loaded 
structures are considered. 
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Measurement of Dielectric and Magnetic 
Properties of Ferromagnetic Materials 
at Microwave 'requencies 


By WILHELM VON AULOCK and JOHN H. ROWEN 


(Manuscript received August 15, 1956) 


Some experimental techniques are discussed which permit measurement 
of the magnetic and dielectric properties of ferrite materials in the micro- 
wave region by observing the perturbation in a cylindrical cavity due to 
insertion of a small ferrite sample, A comparison of the properties of thin 
disc samples with those of small spheres shows that dises yield more accurate 
results in the region below ferromagnetic resonance whereas spheres are pre- 
ferable for the study of ferrite properties near resonance. A short description 
of instrumentation for cavity measurements at 9,200 me is given and experi- 
mental results of disc measurements are reported for a low-loss BTL ferrite 
and several disc diameters. A comparison of experimental results with 
Polder’s theory indicates that the loss of polycrystalline ferrites below reson- 
ance is considerably lower than that predicted from an evaluation of the 
width of the resonance absorption line. 


lL. INTRODUCTION 


The dielectric and magnetic properties of semi-conducting ferromag- 
netic materials such as ferrites have been the subject of intense study 
in recent years. Analytical expressions for the components u and « of 
the permeability tensor of a loss-free single-crystal ferrite were derived 
by Polder. These expressions were later modified to include a loss factor 
a.2:3 Yager and others measured the resonance absorption of single 
crystals of nickel ferrite and found very good agreement with theory 
provided the loss factor a was determined from the width of the meas- 
ured resonance absorption line. However, when Artman and Tannen- 
wald® measured the real and imaginary parts of w and « for polyerystal- 
line ferrites they found that agreement with theory was somewhat less 
than perfect if @ was also determined from the measured line width, 
Discrepancies were observed for both real and imaginary parts of w + 
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x in the region below resonance because the effects of polyerystalline 
structure and anisotropy forces were neglected in Polder’s and Hogan’s? 
analysis. Furthermore, it was assumed in the derivation of the perme- 
ability tensor that the ferrite is saturated with a biasing de magnetic 
field which is large compared to the microwave magnetic field. 

There exists a great need for experimental data for all those conditions 
where some of the above assumptions do not hold. In particular, the 
region of biasing magnetization between zero and ferromagnetic reso- 
nance is of interest because it is the operating region for many ferrite 
devices such as phase shifters, modulators, and field displacement iso- 
lators. Techniques for the measurement of ferrite parameters below 
resonance were investigated and it was found that the measurement of 
the perturbation of a degenerate cylindrical cavity by a thin ferrite disc 
yielded accurate results, whereas observation of the cavity perturbation 
caused by a small sphere produced less accurate data. 

It is the purpose of this paper to deseribe and discuss the thin dise 
method and to compare it with other techniques described in the litera- 
ture.®» 7 After defining the ferrite parameters as constants in Maxwell’s 
equations it is shown how these parameters can be obtained from 
various measuring techniques. Instrumentation for the thin dise tech- 
nique is described and a few remarks are made pertaining to experi- 
mental difficulties. Finally, some measurements of low-loss ferrites are 
reported and compared with values predicted by Polder’s relations. 


2. DESCRIPTION OF FERRITE PARAMETERS 


It is customary® to define the electric and magnetic polarization vec- 
tors P and M in terms of the field vectors # (electric field intensity), 
D (electric displacement), H (magnetic field intensity), and B (mag- 
netic induction). In the M.K.S. system we have: 

P= D — 

M = B/w — 
& = 8.854 x 10” farad/meter, permittivity of free space 
uo = 4r X 10° henry/meter, permeability of free space 
Then, the intrinsic parameters of a ferrite medium are defined as those 
quantities which relate P and M to the electric and magnetic fields in 
the medium respectively. 
P 
M 


Whereas the electric susceptibility x, is a scalar quantity in ferrites the 


; 
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magnetic susceptibility x, is known to have tensor properties. Assum- 
ing that the static magnetic field H, is in the z-direction we have 

0 

(1) 

0 0 

If we restrict ourselves to sinusoidal time variation of the RF fields we 

may describe electric and magnetic losses in the ferrite by regarding 

Xe, Xm, and « as complex quantities: 


| 
| 


/ 
K 
Thus, it is seen that the RF properties of a ferrite medium regardless of 
geometry are completely described by six “intrinsic”? parameters, 
Xm", and x”. The dielectric constant and permeability 
‘u of the material are obtained from 
+ ! 
where 1 is the unit matrix. 

It is the objective of the measurement to obtain each of the above 
parameters as a function of one or more variables of interest such as 
frequency, saturation magnetization, static magnetic field, temperature, 
applied power and others. Measurements may be made on single ferrite 
crystals — mostly for research purposes — or on polycrystalline ma- 
terial for many purposes in connection with the development of ferrite 
materials and devices. These measurements are generally compared to 
the behavior of x, and « as predicted by Polder’s equations. One ob- 

tains from the equation of motion of magnetization 


_ 
YH? w 


T ‘ r . . 
Using Suhl and Walker’s’ notation this may be written as 


Xm =u—-1l= 


Xm 


| 
7 
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where p = |y | M./w normalized saturation magnetization 


o = |y| H./w normalized static magnetic field in the ferrite 
y | = 2.8 me per oersted, gyromagnetic ratio 
w = operating frequency 
It appears that some cavity techniques measure the eigenvalues of (1), 


Xm + x and x» — x, directly, which are seen to be 


Xm = (2) 


Suhl and Walker show that a loss term may be introduced by replacing 
a by o + ja(sgn p) in (2).* Separating real and imaginary parts we get 
(3) 
(ao + 1)? +. 


ap(sgn p) 


m + ‘ 


For the determination of a from measurements it is convenient to define 
a loss tangent 


5, aXm _ alsgn p) (5) 


Xm + «’ 
Typical curves for xm + « and 6, assuming p = 0.5 anda = 5 X 10° 
are shown on Figure 1. For the purpose of describing and comparing 
experimental results, it may be convenient to distinguish among various 
regions of H/, as indicated on the graph because a different measurement 
technique may be required for accurate measurements in each region. 


3. METHODS FOR MEASURING MAGNETIC PROPERTIES 


Three measurement methods have been reported in the literature all 
of which employ the detuning and change in 1/Q of a resonant cavity by 
a small ferrite sample. Van Trier’ used very thin long cylindrical samples 
in a coaxial cavity. Artman and Tannenwald’ employed small spheres, 
and we used thin dises® both placed close to the endwall of a cylindrical 
degenerate cavity excited by a TE: mode (Figs. 2 and 3). Recently, 
Berk and Lengyel” suggested the use of a cylindrical post at the center 
~ * By definition sgn p = +1 for p > Oand sgn p = —1 for p < 0. 

t Since it is customary to use x, + « for the designation of the resonance line 
this notation has been used here. Consequently, p and ¢ should be assumed nega- 
tive. 


1 
+ } 
t 
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NORMALIZED STATIC MAGNETIC FIELD, o = i 


Fig. 1 — Theoretical values of x, + « and loss tangent 6: versus normalized 
static field o. 


of a degenerate rectangular cavity.* In principle, all these methods per- 
mit the determination of the six parameters x.’, Xe”, Xm’, Xm”, x’, and x”. 
However, in practical applications there are significant differences, e.g., 

* As this paper was being written another variation of the thin cylinder tech- 


nique using the TMiio mode in a circular cavity was reported by Spencer and 
LeCraw at the IL.R.E. Convention, New York, Mar. 21, 1956. 


6 
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between the use of a spherical sample and a thin disc, such as the per- 
turbing effect on the cavity field, the accuracy of small loss measure- 
ment, the occurrence of resonance at a static field where the intrinsic 
parameters are not at resonance,” and the possibility of making accurate 
measurements of the electric susceptibility. Therefore, the sphere method 
and the dise method will be reviewed briefly and an attempt will be 
made to compare their capabilities. It is hoped that this comparison will 
also be helpful for the evaluation of other measuring techniques not 
covered in this paper. 


3.1 The Small Sphere Method 


In order to appreciate the significance of the quantities measured with 
the small sphere method it is expedient to define an effective suscep- 
tibility tensor x,,. by relating the magnetization vector* to the applied 


Fig. 2 — Degenerate TE), cylindrical cavity with ferrite sphere. 


Fig. 3-— Degenerate TE. cylindrical cavity with ferrite dise. 


*R. A. Waldron (Institute of Electrical Engineers, Convention Oct. 29 to 


Nov. 2, on Ferrites, London, 1956) related the magnetic induction vector B in a 


¢ 
: 
: 
> 
; 
2 
/ 
z 
iat 


MEASUREMENT AND MAGNETIC PROPEKTIES 433 


OF 


DIELECTRIC 


field 


M = 


and observing that the RF components of MW and H° (denoted by lower 
case letters) are related in a cylindrical coordinate system by 


0 . 0 


(6) 
me = + Xmshor 


Placing the small ferrite sphere close to the endwall of the cavity (Fig. 2) 
and observing the splitting of the resonance into two frequencies w, 


(related to the positive and negative circularly polarized modes) and the 
two changes in 1/Q of the cavity after application of a static magnetic 
field in the axial direction leads to two measurable quantities 


Aw, = wo — wy frequency shift 


(1,/Q)4 1/Q: — 1/Qo change in internal Q of the cavity 


Where wy and Qo are resonance frequency and Q of the empty cavity. 


It can be shown that real and imaginary part of x. and «, can be ob- 
tained 


= 0.6982>° . (xXme’ Ks’) (7) 
Wo 
A(1/ Q)4 — 0.6982 do (Xme”” + Kg ) (S) 


The quantities d), ), and L are the sphere diameter, cavity diameter 
and length respectively, A» is the wavelength in free space associated with 
» . In order to obtain the intrinsic parameters x» and «x from (7) and 
(8) one may use the relationships” 


W=H+M/3 (9) 
k) 
ms + ‘3 10) 


- - 
sphere to the applied field by writing ~ B = u,H° where ps may be designated the 


external relative permeability tensor. ‘ can be readily shown ths at Waldron’s re 


sults are in agreement with ours if one notes that (2 3)xme = = Me — 1. We found 
the at the use of the effective susceptibility tensor Xms is much to be preferred over 
i, because it simplifies notation and interpretation of experimental results in terms 
of the intrinsic quantities x, and x. 

t Equations (7) and (8) are identieal to Artman and Tannenwald’s® expressions, 
if + 22D2/L2) is substituted for Ao?. 
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Equation (10) has a pole at xm + « = —3 which simply indicates the 

resonance condition for a sphere as derived by Kittel.” This can be 
verified from (2): 

Xm EKK= (11) 

Note that p and o are either both positive or both negative, hence, only 
one of the two quantities (ym + «) or (xm — «) goes through resonance 
at |o|) = 1. A similar situation exists for x». -— «x. expressed in terms 
of (11) 


op 

Xme = (12) 
One of the two quantities (xm. + «») goes through resonance at 

lols (13) 

Observing that the field in the sphere is given by (9) this may be written 


as 


ly | 


(KNittel’s resonance frequency of a ferrite sphere) 


It is easily seen that this resonance of the spherical sample makes the 

evaluation of x,, and «x from (10) rather unattractive because one would 

expect inaccurate results for x,, and « in the vicinity of the sphere reso- 

nance o,. Furthermore, for all numerical computations (10) must be 

separated into real and imaginary parts 

3 3) Cem! + Xm” + «”)’ (15) 
(xm + 38)? + x”)? 


+3)? + (xm” tx”)? 
In general higher order terms of x,” and x” may be neglected, but in the 
vicinity of sphere resonance these terms predominate as the term 
Xm + x’ + 3 vanishes. It can be seen from the preceding discussion 
that the determination of ym’, xm”, «’, and «” from xm and «x, has its 
difficulties. Fortunately, there is an easier way to the interpretation of 
Xms and x, in terms of the intrinsic parameters x» and x. We use (9) to 
define a new quantity o’ in terms of the applied magnetie field. 


Xme Ke’ 


= 9 (16) 


o =o+ p/3 = (17) 


y 
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Now (12) may be written in terms of the normalized applied static field 


Ss 


yp 
Xme Ks = (18) 


Comparison with (11) shows that the functional dependency of the 
effective parameters x,,, and x, on the applied field H’, is identical to 
Polder’s relations for x,, + «. This permits plotting xn». + «. versus /°, 
and relabeling the coordinates x, + « and H.,, provided that Polder’s 
equations hold exactly. It would appear however that one of the reasons 


for measuring ferrite parameters is the fact that Polder’s equations are 


known to be approximations which do not always hold and which are 


subject to many restrictions as pointed out earlier in this paper. Sum- 


ming up, one may say that measurements of the parameters of a small 


ferrite sphere lead to excellent results in terms of effective quantities 
Xms and «x, as a function of the applied static field, but may only be 
approximations when interpreted in terms of the intrinsic parameters 


Xm and x. 


3.2 The Thin Dise Method 


The use of a thin dise rather than a small sphere as a cavity perturba- 
tion eliminates most of the difficulties enumerated in the previous sec- 
tion because the intrinsic parameters x, + «x are measured directly. 
Placing a thin dise against the endwall of a cylindrical TE: mode 
cavity (Fig. 3) and observing the splitting of the resonance frequency 
and change in 1/Q as before one obtains the following relationships 


(see Appendix for derivation) 


dws 


= (xm Ry + x Ro) (19) 
Wy 

A (xm’ Ry + (20) 
+ 


The quantity ¢ denotes the thickness of the dise, and R; and R» are fune- 
tions of the geometry which take into account that the RF magnetic 
field is not constant over the face of the dise. A plot of R; and Re versus 
the ratio of dise diameter to cavity diameter (Fig. 4) shows that the 
functions are closely equal for dise diameters less than }). This implies 


circular polarization of the magnetic field in this region whereas the 


field becomes elliptically polarized as one approaches the outer diameter 
of the cavity. It might be argued that the dise should be small enough 
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1.0 


ie 


04 0.6 
do /D 


Fig. 4 — The functions Rk; and FR, versus the ratio of dise diameter dy to cavity 
diameter D. 


to be entirely within the circularly polarized region. However, this re- 
quirement appears to assume that there is some interaction, such as 
spinwave coupling, between adjacent regions in the disc. It is possible 
that such interaction exists in single crystals and leads to multiple 
resonance effects. However in polycrystalline material it is safe to assume 
that each crystallite reacts independently with the RF magnetic field, 
enabling us to sum these effects by simple integration. 

This integration has been performed in the derivation of (19) and (20) 
under the assumption that the dise is thin enough to introduce only a 
first order perturbation into the cavity field. It will be shown presently 
that these assumptions are consistent with experimental results. 

The measured effects Aw, and A(1/Q), depend on the volume of the 
perturbing body, hence one would expect that these effects are at least 
an order of magnitude larger for thin discs than for small spheres. This 
permits very accurate measurement of ferrite parameters in the region 
below saturation and below and above resonance. In particular, the loss 
parameters xm” , «” of modern low-loss ferrites can be determined ac- 


curately in these regions. However, in the resonance region, the meas- 
ured effects become so large that measurement becomes difficult. Hence, 


Ra 

| 

| 
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we do not recommend this method for measurement of resonance line- 
width. The greatest advantage of the thin dise method lies in its ability 
to explore the region below resonance where many ferrite devices operate. 
It will be noted that this region is wider for the dise than for a small 
sphere or for a thin cylinder magnetized normal to its axis. Using Kittel’s 
relations we find the field in the ferrite at which resonance occurs for a 
given saturation magnetization and operating frequency 


M 
ly| ly| 3 


Thin Dise Small Sphere Thin Cylinder 


(21) 


It is seen that for some materials a small sphere or a thin cylinder may 
be at resonance even before it is saturated. 


4. METHODS FOR MEASURING THE DIELECTRIC PROPERTIES 


In principle the electric susceptibility x. may be measured with any 
of the three sample shapes discussed above provided the sample is 
placed into a region of maximum electric and vanishing magnetic field, 
e.g., the center of a TEy, mode cylindrical cavity. A simple computation 
shows that a small sphere located at the center of the cavity produces a 
frequency shift and change in 1/Q as follows: 


Aw Xe dy’ 
Wo ° Xe + PL 


3 
Xe do 
A(1/Q) = 25.136 - 
/ (xe + 3)? 
In actual measurements it is found that the dielectric constant is not 
entirely independent of the diameter of the sphere, and that the measure- 
ment of the loss factor x,” is difficult because the change in 1/Q is very 
small. An additional inherent difficulty of this method is the fact that 
(22) is rather insensitive to large values of x. as are frequently encoun- 
tered with ferrites. Again the thin dise method permits greater ease of 
measurement, larger effects, and a simpler relationship between the ob- 
served quantities and x.. We find (see Appendix A) 
Aw 
¥o = R, (24) 
Wo L 

t 


A(1/Q) 2x. 7 R; 


: 

: 

: 

(29) 
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5. INSTRUMENTATION 


Our measurement technique has been influenced by a number of prac- 
tical considerations including the need for determining accurately and 
quickly a figure of merit for a large number of different ferrite materials. 
In particular, a variety of low loss materials has become available in 
experimental quantities requiring a precise technique for measuring 
small loss factors below resonance as a guide for further ferrite develop- 
ment. Therefore we were faced with the problem to develop an instru- 
mentation capable of measuring these small loss factors but simple 
enough to be operated without detailed knowledge of microwave tech- 
niques. Fortunately the use of thin discs permits us to introduce a fairly 
large volume of ferrite into the cavity without violating the basic as- 
sumption of a small perturbation. As a consequence frequency shifts of 
the order of 10 me are obtained at static magnetic fields just sufficient 
to saturate the material. Thus the quantities x»’ and «’ may be measured 
without difficulty in the regions below and above resonance. 

The thickness of the dise should be chosen to attain an aspect ratio 
(diameter/thickness) of 50 or larger. Dises of 0.005 to 0.007-inch thick- 
ness were employed in actual measurements at 9200 me. For a typical 
measurement of a 0.005-inch dise at 9200 mc, a change of 1/Q of about 
2 per cent corresponds to a loss term xm” + «” = 2 X 10-5. Measure- 
ment of Q with a reproducibility of 1 per cent has been accomplished 
initially by careful work, and it has been our objective to maintain this 
accuracy in routine measurements by semiskilled operators. This re- 
quired the use of rather elaborate circuitry for the precise measurement 
of the changes in 1/Q of the cavity. Although most of these techniques 
have been used before in the field of microwave spectroscopy we hope 
that the description of this instrumentation will be of interest. 

Fig. 5 shows a block diagram of the circuit. A klystron Type V58 
(Varian Associates) is swept through a frequency band of about 80 me 
at X-band. The resulting signal with a center frequency of 9,200 me is 
used to excite a TE, mode cylindrical cavity. Incident and reflected 
signals are separated by means of directional couplers and displayed on 
an oscilloscope. Both signals can be aligned with the aid of a shorting 
gate and a precision attenuator in front of the cavity. The reflected signal 
shows clearly the cavity resonance which splits into two if a ferrite disc 
is placed against the endwall of the cavity and magnetized along the 
cavity axis. 

One of the major problems of the measurement is the accurate deter- 
mination of these new cavity resonance frequencies and of the line width 
of the displayed resonance curves between half-power points. In the 


wy 
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solution of this problem, more than ordinary emphasis was placed on 
ease of operation and elimination of ambiguities in the frequeney de- 
termination. The resulting instrumentation uses as a stable reference 
frequency a crystal-controlled oscillator and a frequency multiplier which 
vields three reference frequencies; 55.8, 223.3 and 4,020 me. These are 
mixed in a crystal harmonic generator and mixer leading to a line spee- 
trum of numerous reference frequencies with a constant spacing of 55.8 
me. The same crystal mixer produces a beat frequency signal between 
the incident signal from the klystron and each of these reference fre- 
quencies. A communication receiver with modified IF stage permits 
selection of a frequency marker out of these beat frequency signals. This 
marker appears as a blank spot on the traces of incident and reflected 
signal, and can be moved to any desired point by simply changing the 


frequency setting of the receiver. Since the receiver dial cannot be read 
with great accuracy on the high frequency ranges, a frequency counter 
connected with the local oscillator of the receiver permits a reading of 
the oscillator frequency to an accuracy of 1 keps. Noting that the local 
oscillator is 0.455 me removed from the difference signal, one obtains 
the frequency of the difference signal with more than sufficient accuracy. 


6. Results of Measurements 
Transmission- and reflection-type cylindrical cavities have been em- 


ployed for measurements with linearly and circularly polarized excitation 
in the 6,000- and 9,000-me frequency bands. Circular polarization is pre- 


ferable in the region below saturation where frequency shifts are small. 

It is not necessary to choose ferrite discs of relatively small diameter 
for the purpose of staying within the region of circular polarization close 
to the cavity axis. The derivation of x,, + x is not restricted to circularly 
polarized fields, and the result takes into account that the field becomes 
more and more elliptically polarized as one approaches the edge of the 
cavity. This can be seen by rewriting (19) and (20) as follows: 


1 
Xm + woF RR» [Aw4 (R, + Ro) Aw. (Ry R»)| 


+ Re) — AC/Q)-(Ri — R:2)] (26) 


2FR,\R, 


where F = 
The factor (/?; — Re), which is zero for circular polarization, corrects 
the values for y,, + « if the dise extends into the region of elliptical 


: 
A 
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polarization. For relatively small dises (R; = R»), one obtains 


9 


K’ = Aw 
x wl R, * 


(27) 


1 
Xm = FR, A(1/Q)4 


A large number of measurements was made with a half-wave, reflee- 
tion-type cavity and linearly polarized excitation. Some typical results 
will be discussed below to demonstrate the applicability of the dise 
technique. The frequency shift measurements and yx,,’ + «’ for a ferrite 
material of 1,300 oersted saturation magnetization are shown on Fig. 6. 
The ferrite disc has a thickness of 0.0063 inch and completely covers the 
endwall of the cavity. If the field in the cavity were circularly polarized 
throughout, then the frequency shift would vary as x,,’ + «’. However, 
elliptical polarization causes a deviation of the measured curves for 
Aw, from x, + «’. Agreement between theoretical and experimental 
values of ym’ + «’ is good in the regions below and above resonance. 
Measurements in the resonance region are not possible with a dise of 
this size because frequency shift and change in Q are so large that the 
assumption of a small perturbation is violated. In order to establish 
further that measurements of x,’ + x’ are independent of disc diameter 
three dises of the same material (saturation magnetization 1300 oersted) 
with diameters of 0.249, 0.400, and 1.050 inches were measured in the 
above-mentioned cavity. Plots of xm’ and x’ (Fig. 7) indicate good agree- 
ment for «’ and some scattering of values for x,,’. This can be explained 
by noting that the resonance frequency of the empty cavity enters into 
the computation of xm’, but cancels out for «’ (equation 19). Conse- 
quently, a very small change in the length of the cavity, as might be ex- 
pected from opening and reassembling the device, will produce a notice- 
able error in the low-field region. A change in cavity length of 10-4 inch 
will produce a frequency shift of 1 me at an operating frequency of 
10,000 me and introduce an error of the order of 0.02 into the measure- 
ment of xm’. This error may be minimized by using a relatively large disc. 

Dises with a diameter of 0.4 inch yielded good measurements of the 
imaginary quantities x»/’ + «” in the low-field region. A typical result 
for a low-loss ferrite (Fig. 8) shows the measured quantities A(1/Q), 
and the corresponding x»’" + «”” as a function of the applied magnetic 
field. (The internal field in the ferrite is obtained by subtracting the 
magnetization from the applied field). It is noted that only A(1 /Q) 
can be observed in the resonance region, whereas A(1/Q), becomes too 
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Fic. 6 — Evaluation of x’» + «’ from measurements of frequency shift Aws and 
comparison with Polder’s theory. Low-loss BTL ferrite, saturation magnetization 
1300 oersted. 
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DISC DIAMETER 

~=0.249 INCHES 
© = 0.400 INCHES 
4= 1.050 INCHES 


| 


2000 3000 4000 5000 6000 7000 
H°, APPLIED FIELD IN OERSTEDS 


Fic. 7 —- Measurement of xm’ and x’ versus applied static field for three differ- 
ent dises, cut from the same ferrite block. 


large to be measured. Knowledge of A(1/Q)_ is not sufficient to determine 


tn 
A(1/Q),(Ri — Re) (ef. equation 26) becomes comparable to the term 
A(1/Q)_(R, + R») and is needed to compensate for the anomalous peak 
of the A(1/Q)_ curve. The loss parameters x,,’” + «’’ assume values of 
the order of 10-2 below and above resonance. The estimated error of the 
measurement is 3 X 


in the resonance region because here the correction term 


to 
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CHANGE IN (1/Q) 
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2000 3000 4000 5000 6000 
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Fic. 8 — Evaluation of xm'’ + «’’ from measurements of 4(1/Q)+ for low-loss 
BTL ferrite, saturation magnetization 1300 oersted. 


Whereas it is possible to obtain good agreement between measured 
and theoretical curves of xm’ + «’ from Polder’s relations (3), such an 
agreement cannot be obtained for x,’ + «’’ from a comparison with 
(4). This discrepancy is assumed to be caused by the fact that Polder’s 
theory was developed for single ferrite crystals and did not take into 
account the random orientation of crystal axes in polycrystalline ma- 
terials, such as the ferrites used in these measurements. It is reasonable 
to expect a broadening of the resonance line and a departure from the 
Lorentzian shape in polycrystalline ferrites; hence, the expression for 
the absorption line (4) no longer holds. 

Measurement of the electric susceptibility of ferrites did not present 
any major difficulties, provided some care was used in the suspension of 
the discs at the cavity center. 

Summing up these results it may be said that the disc method has 
yielded satisfactory measurements of the intrinsic parameters of poly- 
crystalline ferrites below and above resonance as well as in the un- 
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saturated region. Measurements of single crystals and of resonance 
curves with this method have not yet been made. 

It is anticipated that in both cases smaller and thinner discs would be 
needed than have been available so far. However, it can be hoped that 
these difficulties will be overcome in the near future and that the dise 
method will be useful also for the study of ferromagnetic resonance 
phenomena. 


APPENDIX 


PERTURBATION OF A DEGENERATE CYLINDRICAL CAVITY DUE TO A THIN 
DISC 


The general perturbation equation for a lossless cavity can be derived 
from energy considerations or directly from Maxwell’s equations. We 
obtain for the shift of resonance frequency due to a small perturbation: 


mit de +5 [ 
wo Wo 
[ dv 


resonance frequency of the empty cavity 

resonance frequency of the cavity after insertion of the perturbing 
sample 

magnetic field intensity vector in the empty cavity 

electric field intensity vector in the empty cavity 

volume of sample 

volume of cavity 

indicates the conjugate value 


The denominator in (Al) indicates the total energy W° stored in the 
empty cavity at resonance, whereas the numerator is equal to the addi- 
tional magnetic energy W,,‘° and electric energy W,“? stored in the 
perturbing sample. 

Equation (A1) is valid if the frequency shift is small, 


Aw _ | #1 — wo <1 (A2) 


Wo wo 


and if the field in the cavity remains essentially unchanged after insertion 
of the sample. In order to apply (A1) to the determination of the tensor 
components x, and « we should attempt to satisfy three conditions: the 
electric field should vanish at the sample, the magnetic field should be 
normal to the static magnetic field, and the relationship between RF 
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magnetization m and RF magnetic field /® in the cavity should be sim- 
ple. All three conditions can be satisfied if a thin ferrite disc is placed 
against the endwall of the cavity (Fig. 3). Noting that the tangential 
component of the magnetic field intensity is continuous at the plane 
face of the dise we have: 
= — jhe 
(A3) 
me = + Xmheo 
Inserting (A3) into (A1) we find the additional magnetic energy stored 
in the dise 
| + + — heh,°*)] dv (A4) 
n order to evaluate (A4) we use the fact that the TEy.-mode can be 
I l luate (A4 the fact that the Tl | I 
expressed as the sum of two circularly polarized modes rotating in 
opposite directions 
= hy’ + hy’ 
(A5) 
EY = Ey + E: 


Thus, we have 


has = (kere Bz 


has = cos Bz 


has = BJ (kere sin Bz 


En. = +B Ji(ker)e* sin Bz 


= jB Ji (kere sin Bz 

amplitude factor 
Bessel function of the first kind 
propagation constant in the z-direction 
propagation constant in the r-direction 
propagation constant in free space 
length of cavity 
wavelength in free space 
radius of cavity 
wavelength in the cavity 
first zero of the derivative of J; 


ety 
) 
(A6) 
; 
2 
= (B fp“)? 
k= 
ve = Pr /a 
\g = 2L 
pi, = 1.841 
| 
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Integration of the electric or magnetic field over the volume of the 
cavity yields the stored energy in the empty cavity at resonance for one 
of the two circularly polarized modes 

= 0.2387 (A8) 


8) 


The magnetic energy W,,” in the dise is found by integrating (A4) 
over the volume of the dise and assuming that the field is constant over 
the thickness ¢ of the disc. We obtain: 

The two functions 2; and R, depend on the ratio of disc radius to cavity 
radius 


R, = 4.1893 ce | + 


; 
a) (Ji(kero)) | (A10) 


= 2.4720(Ji(kero))° (A11) 


It is interesting to note that these two functions are approximately 
equal (Fig. 4) if the dise radius is less than half the cavity radius. In 
this region the field in the cavity is essentially circularly polarized, 
whereas elliptical polarization exists near the wall of the cavity. Inserting 
(A8) and (AQ) into (Al) we find the desired relationship between the 
two frequency shifts associated with positive and negative circular 
polarization and the tensor components x, and x. 

Equations (Al) and (A12) hold for complex x, and « if a complex 

frequency shift is introduced as follows: 

da = w — wa + j(a — ap) (A13) 
The attenuation constant @ may be defined in terms of the internal Q 
of the cavity, a = }w/Q and the internal Q is defined as 


— w(energy stored in circuit) 
average power loss 


Thus, the imaginary part of the frequency shift may be expressed as 
the difference between (1/Q) of the perturbed cavity at the new reso- 
nance frequency w and (1/Qo) referring to the empty cavity at wp . 


(A14) 


We note that the imaginary part of the right hand side of (Al) does 
indeed represent the power dissipation in the perturbing sample over 
the stored energy times w. Hence, taking the imaginary part of (A12) 


is 

A(1/Q) = Q 

: 
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we have a relationship between the change in (1/Q) and the loss terms 
Xm” and x” 
_ 

Clearly, (A15) holds only if the initial Q of the empty cavity is very 
high and the change in Q is quite small. 

The electric susceptibility of the ferrite disc can be obtained in a 
similar way, provided we place the dise at the cavity center where the 
electric field has a maximum. Then, the additional electric energy 
stored in the dise is found to be 


A(1/Q)s. (xm” Ry + x” Re) (A15) 


w,” = 0.2387 tR, (A16) 
~ ve 

It should be noted that there is an important difference between loca- 
tion of a thin dise at the endwalls and at the center of a cylindrical 
cavity. Whereas the electric field at the endwall may be neglected en- 
tirely, the magnetic field at the cavity center has a component parallel 
to the cavity axis. The effect of this component on the stored energy in 
the dise may be minimized by magnetizing the disc beyond saturation in 
the z-direction. 

With the assumption that the effects of the magnetic RF field may be 
neglected we obtain relationships for the electric susceptibility and 
electric loss factor: 


2(Aw/wo) jAC/Q) —= (xe jx.) R, (A17) 


Since x, is a scalar quantity there is no splitting of the cavity resonance 
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This paper discusses some factors which limit the sensitivity of microwave 
paramagnetic resonance equipments. Several specific systems are analyzed 
and the results verified by measuring the signal-to-noise ratio with known 
amounts of a free radical. The two most promising systems, especially at low 
powers, employ either superheterodyne detection or barretter homodyne de- 
tection. A detailed description of a superhetrodyne spectrometer is given. 
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I. INTRODUCTION 


Within the past few years the field of paramagnetic resonance ab- 
sorption has become an important tool in physical and chemical re- 
search. In many ways its usefulness is limited by the sensitivity of the 
experimental set up. A typical example is the study of semiconductors 
in which case one would like to investigate as small a number of impuri- 
ties as possible. It is the purpose of this paper to analyze the sensitivity 
limits of several experimental set ups under different operating condi- 
tions. This was done in the hope that an understanding of these limita- 
tions would put one in a better position to design a high sensitivity 
electron spin resonance equipment. In the last section the performance 
of the different experimental arrangements is tested. The agreement 
obtained with the predicted performance proves the essential validity 
of the analysis. This paper is primarily for experimental physicists con- 
fronted with the problem of setting up a high sensitivity spectrometer. 


Il. GENERAL BACKGROUND 


We will not consider here the detailed theory’ of the resonance phe- 
nomenon but consider this part of the problem only from a phenomeno- 
logical point of view. When a paramagnetic sample is placed into an RF 
field of amplitude H of a frequency w at right angles to which there is a 
de magnetic field Hy) , magnetic dipole transitions will be induced in the 
neighborhood of the resonance condition 

hw = gBHy (1) 
where g is the spectroscopic splitting factor, h is Planck’s constant and 
8 is the Bohr magneton. As a result of these transitions power will be 
absorbed from the microwave field 1, . This power absorption is asso- 
ciated with the imaginary part of the RF susceptibility x”. The trans- 
mitted (or reflected) HH, will also experience a phase shift which is 
associated with the real part of the RF susceptibility x’. The sensitivity 
of the setup is then determined by how small a power absorption (or 
phase shift) one is able to detect when going through a resonance 


Q CHANGES ASSOCIATED WITH THE ABSORPTION 


The average power absorbed per unit volume of a paramagnetic 
sample is 


P= (2) 


Page 
480 
: 
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For low enough powers x”’ is not a function of H, (and even for very high 
powers never drops off faster than 1/H,°), so that for a large power ab- 
sorption one would like a large RF magnetic field. This suggests a re- 
sonant cavity which indeed is used in all experimental setups. The Q of 
a cavity into which a paramagnetic sample is placed is given by 


Energy Stored 


Q = 


Average Power Dissipated 


where P; = power dissipated in the cavity in the absence of any para- 
magnetic losses, V, is the sample volume and V. the cavity volume. 

Assuming that the paramagnetic losses are small in comparison with 
P, we get 


Qo} = dary’ 


AQ = Qo 


where (Qo is the cavity Q in the absence of paramagnetic losses and 7 
is the filling factor and depends on the field distribution in the cavity 
and the sample. For example, in a rectangular cavity excited in the TE, 
mode 


4 


(9) 
V. i+ 
a 


where d is the length of the cavity and a the width along which the / 
field varies. In the above example it was assumed that the sample is 
small in comparison to a wavelength and is placed in the max. //; field. 


IV. COUPLING TO RESONANT CAVITIES FOR MAXIMUM OUTPUT 


Having established the Q changes associated with the resonance 
absorption, we will next determine the proper coupling to the resonant 
cavity in order that the Q changes result in a maximum change in trans- 
mitted or reflected power (or voltage). The derivation will be based on 
the assumption that we have a fixed amount of power available from 
our source and that the Q change is not a function of the RF power (no 
saturation effects). 


\ 
: 
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A. Reflection Cavity 


Fig. 1 shows a magic (hybrid) 7 which serves to observe the reflected 
power from the cavity. Arm 3 has a slide screw tuner which serves to 
balance out some of the power coming from arm 2. This does not affect 
the present analysis and will be considered later in connection with 
detector noise. It should be mentioned, however, that a certain ampli- 
tude or phase unbalance has to be left. This insures that the signal in 
arm 4 will be a function of either x’ or x”’.’ In the case that the magic 7 
is completely balanced out the signal in arm 4 will be a function of both 
x’ and x” and the experimental results become difficult to analyze. 

Fig. 2 shows the equivalent circuit for a reflection cavity.” The /2 
in the source voltage arises from the fact that half the power is lost in 
arm 3. From this equivalent circuit we can define the following relations: 


Unloaded Q = Qo = = (Losses due to cavity alone) (6) 


wh (Losses arising from power 


arn s ( = = . 
Ron? leaking out of the cavity) 


wl (Losses due to both cavity 
( = ( = 
du Ron? + r and leakage out) 


DETECTOR 


-- CAVITY C 
OP, 


Fig. 1 — A simple arrangement to observe the reflected power from cavity C. 


Fig. 2 — Equivalent circuit for a reflection cavity. 
£ 


a 
(7) 
SLIDE 
TUNER 
MAGIC 
: 
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We define the coupling coefficient 8 = Q,/Q» such that: 


Critically coupled cavity B = = = (9) 


Overcoupled cavity | VSWR = 8 = (10) 


Undercoupled cavity — < 1 VSWR = —-= 11 
I YO. (11) 
We will see that the coupling coefficient for maximum output depends 
on the characteristics of the detecting element. Two cases will be treated: 
the power (square law) detector and the voltage (linear) detector. 


1. Detector Output Proportional to Incident Power 


Power into the cavity at resonance 


Vn r 
(Ron? + 
Max. power available from source (in arm 1) 


_ (Vn)? ,  2Ron'r 


Py = 7 «P= 
yn? (Ron? + ry? 


The change in reflected power AP p equals the change in the power inside 
the cavity AP. (since the incident power stays the same). 
dP. Ron’ —r 
AP, = — Ar = 2Ron Py) ——.. Ar (12) 
or (Ron? + 
We want to optimize AP, with respect to the coupling parameter n° 
(or Ron’), i.e., 
a(AP.) 
0(Ryn?) 
Ron’ 
=24 V3 


r 


(Ron’)? — 4(Ron’)r +r=0 
(13) 


the positive sign being associated with the overcoupled, the negative 
with the undercoupled case. The experimentally measured quantity 
is the voltage standing wave ratio VSWR = 2 + 1/3 = 3.74 correspond- 
ing to a reflection coefficient of 0.58. Putting this value into (12) we get 
for the maximum signal 

AP, 


P = (0.193) (14) 
0 


( 
= 40.193" = 0.193 
r Qo 
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the last step being obtained with the aid of (4). Equation (12) is plotted 
in Fig. 4. From the symmetry of the graph it is obvious that for a given 
VSWR, the signal will be the same for the overcoupled and undercoupled 
case. However, as we will see later from the standpoint of noise the 2 
cases are not necessarily identical. 
2. Detector Output Proportional to Input Voltage 
Let T be the reflection coefficient, then Ver: from the cavity is 
a ee V2\VSWR + 1 V2 VSWR + 1 


With the aid of (10) and (11) this gives for the undercoupled case: 


2r 


and the overcoupled case 


V 2R on” 
Veer, = /9 Ron? +r 


We are interested only in the change of output voltage which is: 


OV 4/3 Vir Ron’ 


or (Ron? + r)? 19) 


= 
The two signs corresponding to the undercoupled or overcoupled case, 
respectively. In order to find the optimum coupling 


O(AV) 
(Ron?) 


= Rn’ —r=0 
2 
= 
Putting this value into (15) we get the max. value 


AV V2 Ar 


” 


(15) is again plotted in Fig. 4. From this graph we see that for maximum 
sensitivity we want to work near match. However, one should not work 
so close to match that the absorption signal will carry the cavity through 
the matehing condition while sweeping through a resonance line. This 
would result (due to the sign reversal of the signal at match) in a dis- 
torted line. Incidentally, the sign of the signal may be conveniently used 
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(a) 


L 


Ri=Ro 
DETECTOR 


(b) 
Cn3Ro 


Fig. 3 — Equivalent circuit for a transmission cavity. 


to determine whether the cavity is overcoupled or undercoupled. This 
information may be necessary in Q» determinations.” 
B. Transmission Cavity 

Fig. 3 shows the equivalent circuit for a transmission cavity, the 
generator and the detector being matched to the waveguide, i.e., 


Re Ry Ro 
Analogous to the reflection cavity we define again two coupling coeffi- 
cients 
Rony Ron? 
= 


r r 


By (18) 


the relation between the unloaded and loaded Q being 
Qo = Qi(1 + Bi + 2) 
1. Detector Output Proportional to Input Power 
V'ny ne Ro 
(Rony? + r + Ron?)? 
4(nRo) 


Power into load P, 


Max. power generator can deliver Po 


OP, 2V'n, ny Ro 
ars or (Rony + r + 


v— 
19 
(19) 
: 
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In the case of the transmission cavity we have two coupling coefficients 
whose optimum value we have to determine. 


O(AP;) O(AP,) 


A(nPRo) A(n2Ro) (20) 


9 
= te Re =r 


which means that the input and output coupling should be identical. 
Relation 20 looks superficially like a matching condition. However, it 
should be noted that the input impedance to the cavity contains besides 
the cavity impedance the load impedance. Hence, the VSWR is 

Ron r 

Ron? 

which represents an undercoupled case. One never can overcouple a 
transmission cavity with equal input and output couplings. Putting 
condition (20) into (19) we get: 


> 8 S S 
al 


2. Detector Output Proportional to Input Voltage 


The voltage across the load 


Vnyneko 


Ve = Rnd 


Again for max. sensitivity both couplings should be the same 


or 
OV, 2 r 
0 Ron = (23) 
AV, 1 Ar 1 AQo 
78 ( (24) 
V Sr 8 Qo 8 Tx Qo 


Fig. 4 is a plot of (12), (15), (19), and (22). It should be noted that 
the sensitivities of the reflection cavity are normalized to the input of 


the magic 7 (in Fig. 1) and not to the input of the cavity as in the trans- 
mission cases. This results in a 3-db decrease in output and causes the 


power sensitivity of the transmission cavity to look relatively higher. 


However this is somewhat arbitrary since a balanced transmission type 
scheme would also require a magic 7 with an accompanying reduction 


in usable power. 


: 
3 
| 


MICROWAVE PARAMAGNETIC RESONANCE ABSORPTION 157 


All the previous sensitivity expressions are proportional to x’. For 
an unsaturated condition it may be replaced by x’ whenever the output 
is sensitive to phase changes in the cavity.’ 

It should be noted that we maximized the output from the detector. 
This will result in a maximum signal to noise ratio if the noise is a 
constant independent of the microwave power. This, however, is in 
general not the case and in the next section we will investigate the signal 
to noise ratio taking into account its dependence on the RF power. 


V. MINIMUM DETECTABLE SIGNAL UNDER IDEAL CONDITIONS 


The minimum signal is ultimately determined by the random thermal 
agitation. Due to this cause the power fluctuates by an amount kT7'Ap, 
where k& is Boltzmann’s constant, 7’ the absolute temperature and Ap 
the bandwidth. The minimum detectable microwave power will be then 
of the order of kT’ Av. This is the problem one faces when designing sensi- 
tive microwave receivers. However, our problem is of a different nature. 
We want to detect a small change in the power level of a relatively large 


UNDERCOUPLED OVERCOUPLED 
x(4 Qo) 
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Fig. 4— Output versus V.S.W.R. for reflection and transmission cavity. 
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microwave signal. This change in power level will have to be consider- 
ably larger than kT’ Av before it can be detected.® The physical reason for 
this is that the fluctuating fields associated with the noise power com- 
bine with the microwave fields to produce power fluetuations much 
larger than /7’Av. It is more straightforward to compare noise voltages 
rather than powers, especially since the power changes are not neces- 
sarily a constant of the system. In lig. 1 for instance, the power change 
in arm 4 is 


(AV)(V,) 
Ro 


whereas the power change in arm 2 (for the same voltage change) is 


AP, 
It also shows that one wants to maximize the change in output voltage 
as was done in Section IV. 
The open terminal RMS noise voltage of a system with an internal 
impedance Ry is given by 
Vrms Av 
If we terminate this system with a noiseless resistor Ry, the voltage 
across it will be ~/R)kTAv. However, the terminating resistor is also 
at temperature 7’, so that the total RMS voltage across it will be 
V2 VRokT dy. 
Comparing this RMS noise voltage with the signal voltage obtained 
in (16), we get for the reflection cavity* 
AV = V 24x = V/RokTAv (25) 
++ XMIN \ 
As an example let us consider the following typical value for a 3-cm 
setup. Q = 5 X 10°, Avy = 0.1 eps, Py = 10°° Watts 
4 ~ 
Ve d\? 


a 
For this case 


*In most cases the behaviour of the transmission and reflection cavity is 
similar, so that they will not be treated separately. 


Al 4 
( 
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This corresponds for an unsaturated Lorentz line’ to a static suscepti- 
bility x» = 7/3x’’(Aw/w), where Aw is the line width between inflection 
points. For the free radical diphenyl! picryl hydrazyl having a 2 oersted 
line width this expression at room temperature gives for the min. number 
of spins 10'°. A plot of the minimum RF susceptibility and minimum 
number of electrons versus microwave power is shown in Fig. 5. 


VI. SIGNAL-TO-NOISE 


IN PRACTICAL SYSTEMS 


A. General Considerations 


1. Why Field Modulation? 


From a design point of view it is instructive to consider the mini- 
mum fractional voltage change corresponding to the above Xmin’’V, of 
‘ 14 rm: 
2 xX 10 ~. This turns out to be, see (16), 


From this figure one may safely conclude that it is not feasible to use 
any system in which the microwave carrier level reflected from the 
cavity has to be kept constant to this accuracy. Such systems would 
include straight detection, the de being bucked out and amplified or 
systems employing amplitude modulation of the carrier. (Although 


10710 10'4 
MINIMUM NUMBER OF SPINS 
_-FOR THE CONDITIONS: 
a Qo = 5x103 

Af = | 1013 
= 
Ww 3 a 

1.51 
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“= T 10!2 ° 
a 
> 
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= 
= 
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MICROWAVE POWER FROM KLYSTRON, Po [Ww] 


Fig. 5— Minimum RF susceptibility and number of electrons which should 
be observable under thermal noise limitations. The conditions for the minimum 
number of spins correspond closely to those under which the experimental set-ups 
were tested. 
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the latter system may be improved by microwave bucking, it still re- 
mains very much inferior to the field modulation system to be described 
presently.) A system commonly used in which the requirements on the 
constancy of the microwave level is less stringent, makes use of a small 
external magnetic field modulation of angular frequency wy super- 
imposed on the slowly varying de magnetic field. Thus in the absence of 
a resonance line the output is zero except for some small Fourier com- 
ponents of the random fluctuations at the frequency wy . If the ampli- 
tude of the field modulation AH y is small in comparison to the line 
width A/#/ this method will sweep out the derivative of the line, i.e., the 
signal will not be proportional to x’’ as previously assumed but to 
dx’’/dH (AH y). In order to preserve the line shape one should sweep 
only over a fraction of the line width. The sensitivity will thereby be 
reduced by roughly the same fraction. It should be noted, however, that 
even if one overmodulates the line (in order to increase the sensitivity) 
the resonance condition (i.e., place of zero signal, corresponding to new 
slope in the absorption) will not shift for a symmetrical line and the 
correct g-value may be obtained. Also from the knowledge of the ampli- 
tude of the modulating field the increase in line width may be corrected 
for. l’or those reasons we will not be concerned with the reduction in sen- 
sitivity due to this field modulation scheme. 


2. Choice of Frequency 


Referring to (26) 


1 
a — — 
V./ Qo VPo 


and the minimum total number of electrons Niwin. 


V.\ (dw 1 
(5 w V/P, 


Assuming that we are dealing with the same type of cavity mode at 
different frequencies, the same power, and that the line width Aw is 


1 1 
constant, we have Vex —, Qo « = 
w 


1 


Equation 28 shows that in order to see the smallest number of spins 
we want to go to as high as frequency as possible. The upper limit is 
given by the availability of components in the millimeter region, by the 


difficulty of handling them and by the maximum available power. The 
most commonly used setups operate at a wavelength of 1 em and 3 cm. 


— 
| ; 
27 
a 
: 
Wa 
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The latter was used in the experimental part of this paper. From (28) 
we see that with a 1 em setup the number of observable electrons should 
be approximately 40 times less than with a 3-cm setup. However, in 
most practical cases one is not limited by the amount of available 
sample, since one usually can increase the sample size at longer 
wavelengths. Therefore a better criterion is the minimum number of 


electrons per unit volume 
1 (=) (29) 
J Qo 


Keeping now the filling factor V/V, constant we see that the sensi- 
tivity of a 3 em setup as defined in (29) is only 4/3 worse than of a 1 em 
setup. In addition the power outputs of 3-em klystrons are usually 
sufficiently higher than those of l-em klystron to overcome even the 
V/3 advantage. If RF saturation comes in, the power argument is not 
valid, but one has to consider the RF magnetic field 1, inside the cavity 
which for a given power and Qs» is prop. to w. Thus at the higher fre- 
quencies (1 ¢m) saturation effects become more pronounced reducing 
again the advantage of a l-cm over a 3-cm setup. In deciding the choice 
of the frequency in special cases (e.g., When Aw is a function of the mag- 
netie field, or the sample is larger than a skin depth) (29) should be used. 

There are, of course, considerations, other than those of max. sensi- 
tivity, which have to be taken into account. For example one would 
always like to satisfy the condition Aw/w «< 1 which favors higher fre- 
quencies. On the other hand, for very narrow lines (say less than 0.1 
oersteds) fractional field instabilities and inhomogeneities will favor 
low magnetic fields, i.e., lower frequencies. One also might encounter 
samples which exhibit an excessive loss in a given frequency band which 
therefore has to be avoided. 


3. Optimum Amount of Sample to be Used 


The output voltage AV is proportional to the sample volume and 
the unloaded Qo of the cavity, see (26). If the sample is lossy an increase 
in its size will reduce the Q and therefore reduce the signal. We may 
roughly distinguish two limiting cases. In one case the losses are pro- 
portional to E? (e.g., high resistivity samples having dielectric losses), 
in the other case they are proportional to Hy,’ (e.g., low resistivity sam- 
ples in which the losses are due to surface currents). 


a. Losses Proportional to k~ 


The paramagnetic sample is placed in the region of max. RF magnetic 
field for instance at the end plate of a rectangular cavity resonating in 


: 
i 
o 
if 
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the TE) mode. The sample extends a distance x into the cavity, « being 
assumed to be small in comparison to the wavelength so that //; does 
not vary appreciably over the sample. The additional losses due to the 
sample will then be proportional to 


A Ey sin’ ( ~ 


whereas the volume of the sample is proportional to x. The observed 
voltage change AV is then given by 


AV«Q)V,« 1 


( Y, 3 


where Qp is the Q of the cavity without sample, and Qo’ the total Q of 
both cavity and sample. Maximizing the voltage change AV we get 


1 
2Qo (30) 


Qo = 


Equation (30) tells us that we should load the cavity with the sample 
until the Q, is reduced to 3 of its original value. It should be pointed out 
that we optimized the signal and not the more important quantity, the 
signal-to-noise ratio. Therefore the analysis is only valid as long as the 
noise is not a function of Q», (see Section VIB) and that we do not 
saturate the sample (see Section VIII). If either condition does not 
hold the amount of sample to be put in should exceed the above calcu- 
lated value. 


b. Losses Proportional to Hy’ 


The losses do not vary along the sample so that we may write 


AV «Qo V,« 


which clearly has no maximum for 2. One should therefore put as big a 
sample into the cavity as possible (compatible with the assumption that 
if it be small in comparison to a wavelength), the same result as if one 
had no losses at all. 


B. Noise Due to Frequency Instabilities 


Before considering the signal-to-noise ratio for specific systems we 
will investigate a noise source which is common to all of them. It arises 


(x) 
; 
) 
1 (a 
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from the random frequency variations of the microwave source or from 
the random variations of the resonant frequency of the cavity (e.g., 
rising helium bubbles at 4°K or just any microphonics). 

From the equivalent circuit of a reflection cavity (see Fig. 2) we can 
write for the voltage standing wave ratio 


Zin j ) 
Ro ~ Ron? + Ron? (ut wl 


r 


where 6 = Qpo(2Aw/w) and Aw is the frequency deviation from the 
resonant frequency of the cavity. The reflection coefficient T is then 
given by: 


+ 76) +1 


Ea 


where I'p is the reflection coefficient at the resonant frequency of the 
cavity. The other two terms giving the changes in IT for a given fre- 
quency deviation Aw. The changes of Aw (or AT) having ac components 
near the modulation frequency will thus represent noise terms which 
will pass together with the signal through the detection system. The 
slide screw tuner (see Fig. 1) which is used to buck out part of the micro- 
wave power will introduce an additional reflection coefficient Ty + jI',’. 
Thus the total reflection coefficient will be given by 


We are interested only in the magnitude of V (i.e., | |) reaching the 
detector. Tuning to the dispersion mode (x’) the slide screw tuner is 
adjusted such that Ty’ > Io — I'py. Under these conditions the output 
noise voltage will be given by 


(32) 


Tuning to the absorption (x’’), the condition Ty — I'py > IT’, will be 


r on 2 
7 
| 
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satisfied and (31) becomes 


xl on? +1) + 1) r.) 


An inspection of (31), (82), and (33) shows that the noise voltage, enters 
as a first order effect in 6 when tuned to x’. This is not too surprising 
since a frequency effect is expected to affect predominantly the disper- 
sion mode. When tuned to x” the effect becomes second order as long 
as | Io — I'p | is large. Under those conditions the 2 terms in (33) are of 
comparable magnitude. We can easily see the origin of the second term. 
It arises from the first order out-of-phase component of the noise volt- 
age. Being, however, sensitive only to in-phase components it will be 
reduced to a second order effect—as long as | Ty — T’z | is large, i.e., as 


(33) 


Jong as we have a carrier which makes us insensitive to out-of-phase com- 
ponents.* When I) — Tz goes to zero (33) ceases to hold and the noise 
voltage will be given by (32). 

There are two important conclusions to be drawn from (33). 

We want to keep Ty — I'y as large as possible. Therefore in schemes 
(like the superheterodyne see section VI E) where this is not feasible, 
special care has to be taken to eliminate this noise source. 

From (15) we find that the desired signal is proportional to 


r 


Comparing this expression with (33) we see that the signal-to-noise 

ratio may be improved by increasing Ron’/r, i.e., overcoupling the 
cavity until this noise source does not contribute any more. A compari- 
son of (15) with (32) shows that overcoupling will not improve the 
signal-to-noise ratio when tuned to x’. In this connection it should be 
pointed out that only those frequency stabilization schemes can alleviate 
the problem of frequency instabilities whose response time is at least of 
the order to the inverse modulation frequency since the troublesome noise 
components are at this frequency. Some stabilization schemes make use 
of the cavity into which the sample is placed as the stabilizing element. 
Although this system may be excellent for the observation of x’ (it is 
the only one which can compensate for cavity microphonic), it fails in 
the case of x’. The reason is that x’ makes itself observable essentially by 
a frequency shift which in this scheme would be compensated for. 


* For a similar reason one cannot avoid an admixture of dispersion to an ab- 
sorption signal, when investigating a saturated sample in which x’maz > x” maz- 
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C. Noise Due to Cavity Vibrations 


A noise source which can be very troublesome at high modulation 
fields arises from the currents induced in the walls of the cavity from the 
modulating field. The interaction of these currents with the de magnetic 
field causes mechanical vibrations of the cavity walls. This produces a 
signal when tuned to the dispersion, but to first order should give no 
signal when one is tuned to the cavity and sensitive to the absorption. 
However, any detuning will result in a signal, which, having the right 
frequency will pass through the narrow band amplifier and lock-in 
detector. Since this signal is proportional to the magnetic field, it will 
result in a background signal whose amplitude will vary as the magnetic 
field is being swept and thus causing a continuous shift in the base line. 
In a rectangular cavity this effect can be greatly reduced by a proper 
orientation of the cavity with respect tothede magnetic field. This is due 
to the fact that by squeezing the broad face of a rectangular cavity 
(TE mode) the frequency decreases, whereas by squeezing the narrow 
walls of the cavity the frequency increases. Thus in a proper orientation 
the two effects cancel each other out. We found another way of greatly 
reducing the effect by using a glass cavity having a silver coating* thick 
in comparison to a microwave skin depth but small in comparison to the 
modulation frequency skin depth, thereby decreasing the eddy currents 
without impairing the mechanical strength of the cavity. 


D. Klystron Noise 


There is very little data available on presently used klystrons. The 
data quoted by Hamilton, et al* are on a 723A klystron. With an IF of 
30 me, bandwidth of 2.5-me microwave output of 50 mw they obtained 
a noise power of 5 X 10°” watts. Expressing their results in terms of a 
noise figure N;, such that the noise power output in the two side bands 
P,, is given by 

1/Pi\ Po 
P, = 2N.(kT Av) ee Ni: = 5 kT Ap sl 0 34) 
Substituting their numerical values one obtains for s = 5000 Watt ™?. 
The values for s that we obtained with a 60 me IF are: 


Higher mode of V-153 klystron s ~ 1,000 Watt™ 
Lower mode of V-153 klystron s ~ 3,000 Watt” 
Higher mode of X-13 klystron s~ 200 Watt” 
Lower mode of X-13 klystron s~ 400 Watt” 


* We are indebted to A. V. Hollenberg and V. J. DeLucea for the making of 
the glass cavities and to A. W. Treptow for the excellent silver coatings. 
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The method used to determine the above noise figures is similar to the 
one described in Reference 4. The figures are expected to be several times 
larger when a 30 me IF is used. (A factor of 2 is quoted by Hamilton, 
et al.*) It is also worth noting that the relative noise power decreases on 
going to higher modes. 


FE. Signal-to-Noise Ratio for Specific Systems 


In this section we will analyze specific systems under varying condi- 
tions. The reason why we do not present the analysis of one ‘““The Best”’ 
system is that sometimes a compromise between complexity and sensi- 
tivity has to be reached and also because some systems may be superior 
at high power whereas others at low powers. 

The expression for the noise power Py at the output of the microwave 
detector (X-tal or bolometer) is’ 


Py => (GNr + F amp. + t—- L)(kT Av) (35) 
where: 
G = conversion gain of the detector (generally smaller than one. A 
quantity often used instead of G is the conversion loss L = 1/G@),. 
Nx = noise figure at the input of the detector. Usually due to random 


amplitude or frequency fluctuations of the microwave source or the mi- 
crowave components (see Section VID). 


Fampt = noise figure of the amplifier 
ft = noise temperature of the detector 
Comparing the equivalent voltage fluctuations of this noise power with 


the signal voltage as derived in (26), we get for the minimum detectable 


4? 


x 


Xmin (36) 


Fame +t — 


The above relation should apply to all systems. The problem then 
reduces to the determination of G, N;, Fame, and ¢ for the particular 
detection scheme. A difficulty arises from the fact that not only are those 
quantities a function of the RF power and modulation frequency but 
in the case of detectors vary from unit to unit. It is probably for this 
reason that the values quoted in the literature are sparse and are not 
in agreement with each other. The values used in this analysis for the 
X-band barretters (821) and erystals (1IN23C) were obtained by us. 
Values for K-band crystals can be found in References 6 and 7. It 
should also be borne in mind that the values are time dependent and 
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will have to be modified as the “art” of detector manufacturing im- 
proves. Also new systems might come into prominence in the near 
future. An example would be the use of low noise travelling wave tubes 
preceding the detector or even low noise solid state masers. 


1. Barretter (bolometer) detection. 
The resistance of a barretter is given by the relation” 
R = Ro + kP" (37) 


lor practical purposes n may be taken as unity. The instantaneous power 
input to the barretter for a modulated microwave is given by 


P= sin Qt (1 + cid sin + (38) 
R 0 

where Vo is the amplitude of the microwaves, AV the change of the am- 

plitude due to the absorption given by (16), Q the microwave fre- 

quency, w the field modulation frequency, and Va. the bias on the 

barretter. Expanding (38) and assuming that AV/V) < 1 we get, after 

throwing out the high frequency terms, 


R 
where Pre is the power in the unmodulated carrier reaching the bar- 
retter. It is of course smaller than Py the microwave power from the 
klystron because of the power splitting in the magic 7 and the reflection 
from the cavity. Taking a reflection coefficient T ~ 0.5 (see Section IVA), 
Prer/Py ~ 0.1. The desired voltage fluctuation associated with the re- 
sistance change is: 6V = /,AR, where 
dR dR 


AR = AP = (AP ee + 


R = Ro + k (Par + Pi “+ oAV sin at) (39) 


aR 
dt 


I) is the current bias on the barretter which we want to keep constant 
for a maximum voltage change 6V. 
The power gain of this device G is given by 
Signal Power from Barretter _ Tok’ 


PacPrr 


G = 
(41) 


Teh) 


| 
To 7p 
2k) 
4 
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The noise figure of the audio amplifier following the detector is given in 
general by: 
Requ + Re 


Famp = Ra (42) 
where Ff, is the generator input resistance in this case the barretter 
resistance and R.quy is an equivalent noise resistor in series with the 
generators. The best input tube that we found was the General. Electric 
GL 6072 triode* for which we measured an R equ at 100 ¢.p.s. of ~10° Q. 
(This is due to flicker noise of the tube and has a 1/f dependence.) If 
we were to connect the barretter, having a resistance of ~200 © straight 
to the grid of the input tube we would get a noise figure of ~500. How- 
Requ 
Re 
noise figure of the amplifier can be reduced to nearly unity. We will as- 
sume in the following analysis that this has been done. 

The noise temperature of the barretter ¢, was thought to be approxi- 
mately 2 since it is merely a platinum wire operating at an elevated 
temperature. To our surprise the measured value turned out to vary 
for different units between 4 and 40.7 The noise figure was measured 
on about 20 different units obtained from 4 different manufacturers 
(P.R.D.; F.X.R. Narda, Sperry). The reason for this noise is not en- 
tirely clear at present. A possible explanation is the non-uniform heat- 
ing of the wire which could set up air currents. They in turn can cool 
the wire in a random fashion giving rise to an additional noise com- 
ponent. An improvement of the noise figure was noted upon evacuating 
the barretter. The noise figure of a unit which was initially 10, dropped 
to the expected value of 2 after evacuation. However, it should be 
pointed out that this cannot be taken as a definite proof for the “air 
current theory” since the characteristics of the barretter changed 
markedly after evacuation. The sensitivity of the evacuated barretter 
went up from 5Q/mW to 2002/mW which necessitated a reduction of the 
de current from 8 to 1.5 mA. Also the response time went up by a factor 
of 20, so that the effectiveness of any noise mechanism with a 1/f spec- 
trum would be greatly reduced. This approach however looks definitely 
promising in trying to design more sensitive and less noisy barretters. 
In the present work commercial unevacuated barretters were used, their 
noise temperature being taken as 4 in the following analysis. Under 


ever, by using a step-up transformer with a turns ratio n > the 


* We are indebted to R. G. Shulman for bringing this tube to our attention. 
7 One unit which exhibited an extremely large noise figure of 1,000 was elim- 
inated entirely. The solder point of the platinum wire was apparently defective. 
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those assumptions (36) becomes: 


XMIN = ther G 


a. Straight detection 


A block diagram of the microwave part of a simple barretter system 
is shown in Fig. 6. The attenuator serves the purpose of preventing power 
saturation of the sample or burn out of the bolometer at high powers. 
By means of the slide screw tuner and magic 7 arrangement one makes 
the system sensitive to either the real or imaginary part of the suscepti- 
bility. 

The characteristics of a typical barretter (like the Sperry No. 821) 
are: R = 2500; k = 4.5 0/mW; Pyax = 32 mW. We take the worst 
generator noise figure reported, i.e., Vy = 5,000 Pre (see Section VID). 

The ratio of the minimum susceptibility xmin-ops that can be de- 
tected with this system to the minimum theoretical value if one were 
limited by thermal noise only becomes with the aid of (41) and (43) 


PrrPac 


Gr (44) 


XMIN-TH 
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Fig. 6 — Essential microwave parts of a simple barretter or crystal set-up. 
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Fig. 7 — The ratio of the minimum detectable susceptibility to the minimum 
theoretical value versus microwave power for 2 different barretter schemes. 
Full lines correspond to the predicted sensitivity and dots indicate experimental 
values. 


Equation (44) is plotted in Fig. 7. From this plot we see that the system 
is extremely poor at low powers (which is due to the low conversion 
gain of barretters) and also starts getting worse at high powers (due to 
the signal generator noise). The latter point is not of great importance 
since one can always buck down the microwave power by means of the 
slide screw tuner to the desired level. By using the evacuated barretter 
as mentioned earlier, the curve in Fig. 7 would be shifted to the left 
corresponding to the increased conversion gain. 


b. Balanced mixer detection 


An improved barretter scheme is shown in Fig. 8. It eliminates the 
poor conversion gain at low powers by employing a balanced mixer 
into which a large amount of microwave power P2 can be fed from the 
same signal generator. Since the barretter noise should not be power 


= 
| 
| 
5 


MICROWAVE PARAMAGNETIC RESONANCE ABSORPTION 471 


dependent (unlike in crystals) this procedure improves the conversion 
gain without increasing the noise. Since a balanced mixer is used the 
noise from the signal generator is also cancelled. A necessary precaution 
in this set-up is to include extra isolation between the second magic B 
and the mixer in order to prevent any microwave power from leaking 
through the balanced mixer into the cavity. 

For this arrangement (44) becomes: 


XMIN-OBs (45) 
XMIN-TH 
The factor of +/2 arises from the fact that we had to split the power 
Po in the first magic 7’. Since in this scheme we are at liberty to vary 
the input power to the barretter we want to maximize G with respect 
to P,. For a fixed total power to the barretter given by its burn-out 
ratings (i.e., P2 + Pa = constant) (41) is a maximum for P; ~ Pace & 
Pyrax 


= Taking again the data for the No. 821 barretter we get for Gyax 


~ 0.5 and for 
~4 (46) 
XMIN-TH 


Since the value of /2 can be held constant irrespective of the power 
in the cavity, this ratio will be a constant (see Fig. 7). 

It should be pointed out that in this system a wrong phasing of arm 
P, will result not only in a reduetion of the signal, but also in an ad- 
mixture of x’ and x’’. Therefore after changing the power by means of a 
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Fig. 8 — Barretter system with balanced mixer. 
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variable flap* attenuator (which also introduces a phase shift) or after 
changing the slide screw tuner the system has to be rephased again. This 
makes saturation measurements less convenient than in the superhetero- 
dyne system to be discussed in the next section. The obvious advantage 
of the homodyne detection scheme is that it requires only one micro- 
wave oscillator. 


2. Crystal detection 


A simple set-up is shown in Fig. 6. Although its microwave components 
are identical to the ones used in the barretter scheme, the analysis of 
this set-up is more complicated. The reason is that not only do crystal 
characteristics vary greatly from unit to unit but they cannot be de- 
scribed by one simple relation over the entire range of incident micro- 
wave power. One can roughly divide their characteristics into a square 
law region where the rectified current J is proportional to Py (holds for 
P, < 10° Watts) and the linear region where J is proportional to 
VP, . (holds for Py) > 10°-* Watts). The output noise of a crystal can be 
represented in general by the relation.’ *'’ 

Py = + 1) kT Av (47) 
where f is the frequency around which the bandwith Av is centered. This 
relation reduces for the square law region to: 


1) kT AS (48) 


and for the linear region to 


The average values of 8 we determined experimentally are: 
B=5 X 10" Watt’ sec ' 
=~ 10" Watt! sec” 
The conversion gain G of the crystal can be represented by 
G = SPur (50) 
in the square law region and by 
G = constant = C (51) 
* The phase shift associated with the Hewlett-Packard X-382-A attenuator is 
quite small. 


+ Values of a for K-band crystals are quoted in References 6 and 7. They differ 
however from each other by approximately 3 orders of magnitude. 
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in the linear region. Values of S and C for the 1N23C were found to be 
S ~ 500 Watt” and C ~ 0.3. 


a. Simple straight detection 


If one does not make use of the bucking possibilities of the magie 7’ 
(i.e., eliminate the slide screw turner in Fig. 6) one has the simplest 
possible set-up sensitive to x’’. Under those conditions the microwave 
power reaching the crystal will be identical to the reflected power from 
the cavity. Equation (36) becomes: 


XMIN-OBS = es + Fame +i- 


G 
XMIN-TH 7 


(52) 


With the aid of (48), (49), (50), and (51), this relation reduces for the 

IN23C in the square law region to: 

_ (1 +5 X 


XMIN-TH 


(53) 


and for the linear region to: 


(3 10 Pi) (54) 
XMIN-TH 

A plot of (53) and (54) is shown in Fig. 9. As before the assumption was 
made that Prr/P> ~ 0.1 (see barretter case). The noise figure of the 
amplifier yp was taken as unity which again can be closely ap- 
proached by means of a step-up transformer. The field modulation 
frequency was assumed to be 1,000 ¢.p.sec., although (47) shows that 
from a point of view of noise one would like to go to as high a frequency 
as possible. However practical consideration such as power require- 
ments for getting a given modulation field, pick-up problems, skin 
depth losses in the cavity wall usually set an upper limit. The modula- 
tion frequency may be also dictated at times by the relaxation times 
of the investigated sample.” 


b. Straight detection with optimum microwave bucking 

From Fig. 9, we see that the straight crystal detection scheme suffers 
at low powers because of the poor conversion gain of the crystal and at 
high powers because of excess crystal noise. This situation ean be 
greatly improved by adding some microwave power to the erystal when 
the reflected power from the cavity is low (to be referred to as positive 
bucking) or subtracting some of the power in the other case (negative 
bucking). In this section we will find the improvement over the unbucked 
system and the amount of bucking required to effect it. 
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wk 
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Fig. 9 — The ratio of the minimum observable susceptibility to the minimum 
theoretical value versus microwave power for different crystal detection schemes. 
Full lines correspond to the predicted sensitivity and dots indicate experimental 
values. 


We define the bucking parameter B by the relation 
P, = BP xp (55) 


Where Pr is the microwave power at the crystal before and P, after 
the bucking is applied. We further assume that after the bucking is ap- 
plied the crystals will operate in the square low region. Combining (49), 
(50), and (52) and neglecting the term GN, which is small in compari- 
son to the other term we get for the bucking scheme: 


F BB’Prr 
XMIN-OBS = (56) 


SBP re 


XMIN-TH 
In order to find the optimum bucking parameter, we set 


d Gs OBS 


oR = () which results in 
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Putting in the numerical values for the 1N23C as quoted previously 
we get that B = 1.4 X 10°-°/Pre . Equation (56) becomes with the opti- 
mum bucking parameter 


XMIN-TH 


XMIN-OBS (58) 


For the case under discussion this ratio turns out to be ~50 independent 
of Py . Fig. 9 shows a plot of (58). The values in parenthesis indicate the 
degree of bucking necessary to accomplish this ratio as determined from 
(57). It should be noted that the negative bucking can be easily accom- 
plished by means of the slide screw tuner in the magic 7’ arm (see Fig. 6) 
whereas for large positive buckings a scheme like in Fig. 8 has to be 
used. (Some positive bucking can of course be also accomplished by 
means of the slide screw tuner). 


c. The superheterodyne scheme 


The RF bucking system just described bears a certain resemblance 
to the balanced mixer barretter scheme. In both cases additional micro- 
wave power was added to the detector in order to increase the con- 
version gain. However in the crystal scheme this resulted in an increase 
in noise power whereas this should not be the case with barretters. The 
question arises whether a decent conversion gain in crystals has to be 
always accompanied by a large noise power. An inspection of (49) 
shows that around frequencies of tens of megacycles* or higher the 
noise output of the crystal becomes negligible. As pointed out earlier 
such high magnetic field modulation frequencies are not feasible. How- 
ever in a superheterodyne system the crystal outputs will be at an 
intermediate frequency of 30 or 60 me. This will make the flicker noise 
components negligible even at high powers where the conversion gain 
is good. The conventional way to obtain the intermediate frequency is 
to beat the reflected signal from the cavity with a local oscillator (see 
Fig. 10) which is removed from the signal generator by the I.F. fre- 
quency. In order to eliminate the noise from the local oscillator a bal- 
anced mixer should be employed. The ratio 


7 

MIN-OBS -,(Fi+t—-1 

becomes then from equ. 52 (59) 
XMIN-TH 

rhe expression in the brackets is called in radar work” the overall 


* It was shown by G. R. Nicoll® that this equation holds up to this frequeney 
range. 
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noise figure of the receiver F. We found that a noise figure of about 
11-14 db is easily attainable with commercial I.F. amplifiers and 
balanced mixer. This would give us a ratio of 


X MIN-OBS 


which is plotted together with the other crystal schemes in Fig. 9. Al- 
though this system does necessitate 2 stable microwave sources, it is not 
difficult to operate once they are set-up. This was not considered as a 
major disadvantage at least not at X-band. The phasing problem dis- 
cussed in connection with the mixer barretter scheme of comparable 
sensitivity is eliminated. An additional small advantage is the rugged- 
ness of crystals in comparison to barretters and the availability of good 
commercial balanced crystal mixers. There are other double frequency 
schemes which do not need 2 separate microwave signal generators. 
The other frequency may be obtained by amplitude or phase modulating 
one signal generator by an IF frequency. The side bands which are pro- 
duced in this way are displaced by just the IF frequency and may be 
utilized instead of the second signal generator. Schemes of this sort 
look particularly promising for frequencies well above X-band in which 
case it might prove difficult to maintain the difference frequency of two 
separate microwave generators within the band width of the IF. 


F. Experimental Determination of Sensitivity Limits 
1. Preparation of samples 


In order to get an experimental check on the previous analysis, 
samples with a known number of spins had to be prepared. Two sets of 
samples were made. One consisted of single CuSO,-5H2O crystals of 
varying sizes hermetically sealed between 2 sheets of polyethylene. The 
other set consisted of different amounts of diphenyl picryl hydrazyl* 
which were similarly sealed up. D.P.H. samples having less than 10" 
spins were prepared by dissolving known amounts of the free radical 
in benzene and putting a drop of this solution on the polyethylene. 
After the benzene had evaporated, it was sealed up with another sheet 
of polyethylene. The g-values of CuSO,-5H,O and D.P.H. differ enough 
so that both samples can be conveniently run simultaneously. This 
was done in order to check the self consistency of the two sets of samples. 
The measured integrated susceptibility of all the D.P.H. samples 


* We are indebted to A. N. Holden for supplying us with this material. 
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with more than 10° spins agreed within a few percent with the caleu- 
lated value. The calculated value being based on the known amount 
of D.P.H. and the measured value being referred to the known amount 
of CuSO,-5H,0. D.P.H. samples with less than 10°° spins had all a 
smaller number of effective spins than calculated. The discrepancy was 
more pronounced the smaller the sample. There was also evidence that 
the smaller D.P.H. samples deteriorated with time. As a typical example 
we quote a sample which started out as 10” effective spins and was 
reduced after 4 weeks to 4 X 10" effective spins and another one which 
initially had 10" spins, deteriorated in the same time interval to 10” 
spins. Since only the smaller samples were noticeably affected, this 
deterioration seems to be associated with a surface reaction. It was also 
observed that the line width between inflection points of the D.P.H. 
samples with less than 10° spins increased from 1.8 oersteds to 2.7 
oersteds. This broadening probably arises from a reduction in the ex- 
change narrowing mechanism due to the spreading out of the sample. 


2. Comparison of experimental results with theory 


In checking the sensitivity of the equipment D.P.H. samples were 
used and the signal to noise was estimated from the recorded output. 
The experimental points thus obtained are shown in Fig. 7 and Fig. 9. 


We believe that the results are significant to within a factor of 2, the 
main error arising from the estimate of the RMS noise. The band width 
of the lock-in detector was Av = 0.03 see', Q) = 4,000, and the field 
modulation used was 3 oersteds p.t.p., 100 ¢.p.sec. for the barretter 


schemes and 1,000 ¢.p.sec. for the crystal schemes. This large modula- 
tion field somewhat distorts the line, but, as mentioned earlier was done 
in order to get the full signal. The D.P.H. samples were calibrated 
against CuSO,4-5H.O before each run. Even so it was not felt safe to 
use samples which had less than 10 spins. 

Referring to Fig. 8 we see that for the straight barretter detector the 
experimental points agree fairly well with the predicted value, but in the 
balanced mixer scheme fall short by about a factor of 4. A possible ex- 
planation of this discrepancy is that the barretters were not completely 
matched in which case the noise from the local oscillator would not be 
compensated for. 

Fig. 9 shows the experimental points for the erystal schemes. For 
powers between 10°‘ W and 10° W the system used fell between the 
simple straight detection scheme and the one utilizing optimum RF 
bucking. The reason is that it was very easy to obtain a certain amount 
of positive bucking (+9 db) by merely adjusting one arm of the magic 7’. 
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It would however have been a great deal more difficult to obtain the 
entire bucking of +22 db at 10°‘ W since a set-up like in Fig. 8 would 
have to be used. Thus for the sake of simplicity the extra factor in signal 
to noise of 2 or 3 was abandoned. The amount of negative bucking at 
the higher powers will be limited by the stability of the bridge. A prac- 
tical limit of (40-50) db was characteristic of our set-up. We see from 
Fig. 9 that the agreement between the experimentally determined sensi- 
tivity and the theoretically predicted sensitivity is satisfactory. 

The experimental results on the superheterodyne scheme agrees 
again very well with the predicted values up to a power level of 10-* W. 
(This corresponds to less than 10” spins in D.P.H.) Above this level 7; 
(see fig. 10) has to be balanced to better than 40 db to keep the IF carrier 
amplitude within the required value. Instabilities in the bridge due to 
mechanical vibrations and thermal drifts start to contribute to the 
noise. Thus at high power levels the superhet scheme starts to loose 
some of its advantages unless special precautions are being taken to 
eliminate the above mentioned noise factors. A great deal in this direc- 
tion could probably be accomplished by shock-mounting the micro- 
wave components and better temperature stability for slow drifts. 
Since we were mainly interested in powers below 1 ml, our efforts were 
limited to controlling the temperature of the room to +1°C. 

Since the superhet scheme was found to be the most sensitive one, 
it might be worthwhile to discuss it in more detail. A block diagram of the 
set up is shown in Fig. 10. 

The signal generator feeds into the magic 7’, where its power is split 
between arm 2 and 3. Arm 2 has the reflection cavity with the sample, 
the reflected voltage being bucked out with the aid of arm 3. For this 
purpose arm 3 has a phase shifter and attenuator, an arrangement 
which was found to be more satisfactory than a slide screw tuner as far 
as stability and ease of operation goes. The desired signal appears then 
in arm 4. It is fed into a balanced mixer which receives the local oscil- 
lator power from the stabilized klystron II. The output of the balanced 
mixer is then fed through the IF amplifier, detector, audio amplifier and 
lock-in detector. The circuits of each of those components is fairly 
standard and will not be dwelled upon further. The microwave power is 
measured in arm 2 of the magic 7’. The power reflected from the cavity 
is also monitored in arm 2. This is of great help in finding the cavity 
when klystron I is swept in frequency by means of a sawtooth voltage on 
its reflector. Since the klystron mode itself might have some dips in it, 
(which might be mistaken for the cavity), it proved helpful to display on 
the scope the klystron mode simultaneously with the reflected power 
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from the cavity. This also provides a convenient way to measure the Qo 
of the cavity.’” The frequency is measured roughly by means of a cavity 
frequency meter and more precisely by means of a transfer oscillator and 
high speed counter. The magnetic field is measured by means of a nuclear 
magnetic resonance set-up, its frequency being measured on the same 
counter as the microwave frequency. The nuclear resonance signal is 
recorded on the same trace as the electron resonance signal. Thus if the 
magnetic field is homogeneous enough, the nuclear sample will see the 
same field as the electronic sample and g-values can be conveniently 
determined to the accuracy of the nuclear moment (this also assumes 
that the signal is large enough, so that no additional error is introduced 
in determining the exact location of the resonance.) The field modula- 
tion coils are mounted on the pole faces and are energized by a 50-watt 
power amplifier. A field of 50 oersteds p.t.p. is available at 1,000 eps and 
a slightly higher field at 100 eps. 

The magnet is a Verian 12” modified so that it can rotate around an 
axis perpendicular to Ho. This was done mainly in order to make 
anistropy measurements more convenient. This enables one to make 
quick saturation measurements in isotropic materials without having to 
change the incident REF power. This is accomplished by rotating the 
magnetic field and measuring the signal strength versus angle. Since only 
the RF field perpendicular to the de field causes transitions, the signal 
in an unsaturated isotropic sample should go as cos? 6; where @ is the 
angle between H, and Ho . From the deviation from this dependence, the 
saturation parameter can be found. This could also be done by rotating 
the cavity, but at microwaves is not as easy as rotating the field. 


VII. A NOTE ON THE EFFECTIVE BANDWIDTH 


There seems to be some confusion as to how jarrow one should make 
an audio amplifier preceding a phase sensitive detector (lock-in) or why 
the band width of the IF amplifier doesn’t enter in a superhet scheme. 
Those and similar questions have to do with the effective band width 
of the system Av which appears in (26). Since similar questions have 
been rigorously analyzed by other authors,': the present discussion 
will try to stress some of the physical ideas underlying the different 
detection schemes. 

We consider first the simple scheme illustrated in Fig. 11. It consists 
of an amplifier with band width Ay, centered around » followed by a 
phase sensitive detector with a reference voltage at V,. The output of 
the phase sensitive detector has an RC filter of band width Avs. One 
‘an see that in such a system the only noise components centered around 


gy 
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v, (this being also the reference frequency) in a band width Ave will con- 
tribute to the output noise. This is because the beat between 2 noise 
components like v, and vy, (see Fig. 11) is too far removed from », to 
produce an output voltage. (This statement implies the condition that 
Avy, < » otherwise the beat between 2», and », could come through.) 
Thus in this system the band width of the amplifier is immaterial as long 
as the noise voltages are not so large as to saturate it. 

A more serious situation may arise in the absence of a reference volt- 
age. In this case the noise components within the band width A», can 
beat with each other and produce a noise output which would increase 
with the band width. This could become especially detrimental in a 
superheterodyne scheme in which the IF band width can be a million 
times larger than the output band width. It can be shown, however, that 
if the carrier voltage V. at the output of the IF is large enough the IF 
bandwidth AF; does not enter into the noise consideration” the eri- 
terion essentially is that 


> G2kTZAF 1p (60) 


where G is the IF amplifier gain, and Z the input impedance. Condition 
(60) means that we want the noise which beats with the carrier to be 
greater than the beat between 2 noise terms. Since the former is propor- 
tional to the carrier, its predominance can be easily ascertained experi- 
mentally by increasing the IF carrier and noting whether the noise 
output increases proportionally. If it does, (60) is fulfilled. 
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Fig. 11 — Effective band width of a phase sensitive detector Avesp = Ave. 
Note that the band width of the amplifier does not enter as long as Av; <  . 
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In order to see what maximum gain G (60) imposes on a typical sys- 
tem we assume AFyp = 5 X 10° c.p.sec. Z = 10°; Ve ~ 1V. Under 
those conditions we get from (60) that G has to be smaller than approxi- 
mately 10°. If on the other hand G is very small the signal level at the 
audio amplifier input is so low that the flicker noise of the detector can 
still come in. A good practical figure for the IF amplifier gain is around 
60 db. 


VIIT. SATURATION EFFECTS 


In all the previous considerations RF power saturation effects were 
neglected, i.e., we have assumed that the power absorbed is proportional 
to H,’, where H, is the RF magnetic field. When this assumption is no 
longer satisfied, the question of sensitivity has to be re-examined for 
different degrees of saturation. However it is difficult from an experi- 
mental point of view to change the conditions of the experiment for each 
degree of saturation and therefore an elaborate analysis of this case 
does not seem to be warranted. However it might be of interest to see 
the effect on the in phase component of the signal at complete or nearly 
complete saturation 

The change in output voltage for a reflection cavity is (15) 


Ron’ 
= 
AV = V2) (Ron? + 


and from (4) 


A AG 
r 


The RF magnetic field in the cavity is given by 
HY = — T")Pin (71) 


where C is a constant dependent on the geometry of the cavity the re- 

flection coefficient. Assuming a simple homogeneous saturation be- 

haviour we substitute for the saturated value of x,’” 


ad Xu Xu (72 


where x.’ is the unsaturated value of the susceptibility and P;, the 
power from the microwave source. 


Ar Xu 


and the output voltage AV. 
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Ron 2 x ” 
AV = V/2 Voz 
(Ryn? + + — T?)P in 
For a high degree of saturation 
T1T2CQ(1 — > 1 


and substituting for 


we get: 


AV = V2 Vorn eg 


75) 


The above relation shows that under saturated conditions the Q of the 
‘avity does not enter and one might as well not use one or use a very 
much overcoupled cavity. This is one of the reasons why in microwave 

gas spectroscopy,* where lines are easier saturated a cavity is not used. 
(The more important reason is that in most cases one sweeps the fre- 
quency of the source, so that a cavity is difficult to use.) 

Equation (75) also shows that the signal and also signal to noise goes 
down with increasing RI power. The above argument does not hold for 
the out-of-phase (dispersion) signal, in particular it breaks down com- 
pletely for signals observed under fast adiabatic passage conditions.’ 
Yor the latter case one wants as high an RF field as possible. 
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The Determination of Pressure Coefficients 
of Capacitance for Certain Geometries 


By D. W. MeCALL 


(Manuscript received February 15, 1955) 


Expressions are derived for the pressure coefficients of capacitance of 
parallel plate capacitors subjected to one-dimensional and hydrostatic 
pressures and of cylindrical capacitors subjected to radial compression. The 
derivations apply to systems in which the dielectrics are isotropic, elastic 
solids. 


I. INTRODUCTION 


The electrical capacitance between two conductors separated by a 
dielectric is a quantity which can be calculated with ease only in certain 
geometrical arrangements of high symmetry. Even the classic example 
of parallel plates presents major difficulties as one may only perform the 
calculation exactly for the case of plates of infinite area or vanishing 
separation. The approximation becomes poor when (are: )} (separation) 
becomes small and the theoretical treatment of edge effects is sufficiently 
difficult that it has not been solved though the solution would greatly 
facilitate dielectric constant measurement. 

When pressure enters into the situation as a variable the difficulties 
are enhanced as one must be able to describe the geometry effects as 
well as the change in dielectric constant. 

The engineers responsible for designing submarine cables are con- 
fronted with the necessity of knowing the manner in which capacitance 
depends upon pressure as may be illustrated in the following way. A 
submarine telephone cable is composed of a central copper conductor 
surrounded by a sheath of dielectric material. Due to the extreme 
length repeaters must be placed at intervals, the separation being deter- 
mined by the attenuation of the cable. The attenuation, a, of a coaxial 
telephone cable may be written 


a = (G/2)(L/C)' + (R/2)(C/L)* 
where G is the conductance of the dielectric per unit length, C the 
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capacitance, L the inductance, and #& the conductor resistance per unit 
length. The second term contributes about 99% of the attenuation. 
Considering only this term we deduce 


(1/a)(da/AP) & (1/R)(AR/AP) +. (1/2C)(8C/aP) — (1/2L)(@L/aP) 


Accurate knowledge of the coefficient (1/C)(@C/dP) is thus essential 
in designing very long cables which are to be exposed to high pressures. 

In evaluating dielectric materials for use in cables it is often desirable 
to make measurements on sheet specimens rather than cable. It thus be- 
comes necessary to be able to translate sheet data into cable data. It 
is the purpose of this paper to analyze the problem of calculating pres- 
sure coefficients of capacitance for certain simple geometries and to 
consider the methods of measurement which have been used. It will be 
shown that results of theory and experiment are in as good agreement 
as can be expected but more accurate measurements of electric and 
elastic properties are needed. 

The equations which will be derived are also necessary if one wishes 
to determine the dependence of dielectric constant on pressure using 
any of the geometries described herein. 

The problems treated in this paper are particularly simple and amen- 
able to mathematical treatment but many problems encountered in sub- 
marine cable design are at present subject to solution only by empirical 
means. Fundamental investigations of the effects of pressure on dielectric 
materials are needed. 


Il. THEORETICAL TREATMENT 


In the following treatment we consider that the dielectric substance 
is an elastic solid which obeys Hooke’s law. We denote the relative per- 
mittivity or dielectric constant by ¢, the permittivity of free space by 
& ,* the principal stresses and strains by 7;; and e;;, the density by p, 
the compressibility by k, and Poisson’s ratio by a. 


1 de 
A. Calculation of 


One of the quantities which will be needed in the evaluation of 
. i@e 
— 18 
CaP 
As € is not dependent on the geometric configuration it can be calculated 


* = 8.86 107! farads/meter. 
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in general and the result applied to each of the special cases to follow. 
A relation between dielectric constant and density is required and 
usually, when dealing with non-polar dielectrics, one assumes that the 
Clausius-Mosotti relation gives the proper dependence. That is 

e- 
—>5 = (constant) p . (1) 


e+ 2 


if 


This formula may be differentiated to give 


(2) 


In the theory presented herein, (1) will be used though it is at best an 
approximation. Corrections to the Clausius-Mosotti formula’ which 


have been given do not seem applicable to polymer dielectrics and in- 
troduce parameters which must be fitted. 


B. The effect of a One-Dimensional Pressure Acting on a Dise 


Consider a one-dimensional pressure, —P, acting along the axis of a 
circular dise of dielectric material with electrodes affixed to opposite 
faces. Assume the dise is constrained such that no lateral displacement 
van occur. Let ¢ be the thickness and A the area of the dise. 

The capacitance of such a capacitor is given by the equation 


so the desired pressure coefficient is 


1aC _lde 1 at 


CaP 2aP 


where use has been made of the condition that the area is constant (i.e., 
no lateral displacement). Hooke’s law states 


k 
— 2e) [r22 — + T2:)| (4) 
= 
k 


3(1 — 20) 


Crz 


[Ty + (5) 


Czz 


[r22 + Tyy) | (6) 


'C. J. F. Bottcher, Theory of Electric Polarisation, Elsevier Publishing Co., 
Amsterdam, 1952, p. 199 et. seq. 
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We assume the z-axis lies along the dise axis so 7,, = —P and by sym- 
metry tr: = Tyy. The condition of no lateral strain states e,, = Cy = 0 
which when combined with (4) gives 


Tyy I 


Using the last result with (6) we obtain 


Czz 


~~ paP~ 
Combination of (2), (3), and (7) results in 


1aC (e — 1)(e + 2) k (; 


C. The Effect of a Hydrostatic Pressure Acting on a Disc 
Assume that conditions are similar to those considered in section B 
except that ¢.. = yy = 0 is now replaced by 
Tes = Ty = TT, = —P. 
The area is no longer independent of pressure so 


10C 1 1 0A 
Aap 


CaP taP 
Hooke’s law, (4), (5), (6), now becomes 


ey = = (i = a, y, 2) 


+ Cyy + €22) 


+ 


2k 


3 


3 
a kP (1+ 
(9) 
Thus 
1 
A @P 10) 
1 
| 1 at _e 
(12) 
: 
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Combination of (2), (9), (10), (11), and (12) results in 


dC _ | te — + 2) 1 
CaP 


D. The Effect of a Radial Pressure Acting on a Cylindrical Annulus 


Consider a radial pressure acting normally to the axis of a cylindrical 
annulus to which electrodes are affixed to the inner and outer surfaces. 
Let the inner and outer radii be a and 6 respectively and assume the 
cylinder is filled with an incompressible substance so that the inner 
radius is not pressure dependent. The capacitance per unit length is 
given by 


1 ac l 1 1 db 
bbaP (14) 


CaP 
ht 
We employ cylindrical coordinates (r, 6, z) where the z-axis is taken along 
the cylinder axis. Hooke’s iaw becomes 

= 
3(1 — 20)” 
k 


3(1 — 2c) [To O(T rr + Tz2)| 


k 


— 2c) 


— o(ree + 7:2)] (15) 


— o(trr + 


Equilibrium of an arbitrary volume element demands” 


ta = 
72 


B 


(19) 
72 


m= A— 


where A and B are constants with respect to spatial coordinates. (A 


? J. Prescott, Applied Elasticity, Dover Publications, New York, 1946, p. 330. 


: 
50 
C00 (16) 
(18) 
an 
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should not be confused with the electrode area used in 
tions.) We assume 


previous sec- 
0 for all (r, 6, 2) (20) 

= QOforr=a (21) 

and P forr = b (22) 


Manipulation of (15) through (22) allows the evaluation of the con- 
stants A and B as 


a* 


— 20) 


— 20)P 
+ (1 — 20) 


p oP 
1 dp _ 21+ 


p oP 


yields 


bap 


Combining (2), (14), (23), and (24) we obtain 


G ol of (1 2c) 


(e 
) 


E. The Case ¢ = 3 


The equations derived above reduce to the expressions one would 
obtain if the dielectric were considered to be a compressible fluid when 
o is set equal to 4 


; 
23) 
2 
? 
a 
2 
(25) 
: 
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Equation (8) for the parallel plate arrangement becomes 
1 aC (e — 1)(e + 2) ak 
C at 
while (13) is unaltered. The difference between the two cases arises 
from the fact that in (13) the area was allowed to vary while in the former 
case it was not. The deviation of 
1 aC 
C aP 
from (26) when o # 0.5 is given by the factor 


1 l+o 


3\l—<e 
The capacitance-pressure coefficient for the cylindrical configura- 
tion, (25), becomes 


La (e — 1)(e + 2) 


CaP «Be 


The deviation of 
1 aC 
CaP 
from the value given in (27) when o # 0.5 is thus given by the factor 


9 


b 
a 


+ (1 — 20) 


9 


(1 + o) 


APPARATUS 


The experimental arrangement employed to investigate the validity 
of (8) is shown in Fig. 1.* Pressure was applied by means of a Baldwin 
tensile testing machine. The cell makes use of a “sandwich” arrangement 
wherein two dise samples (2” in diameter, 0.050” thick) of dielectric are 
pressed between three brass electrodes, the outer electrodes being 
grounded. The capacitance thus formed is well shielded and stray capaci- 


* This cell was designed by C. A. Bieling. 


= 
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tances are minimized. Lateral displacements are kept small by an annular 
ring of steatite ceramic which is in turn surrounded by a ring of Ketos 
steel. Pressures of 23,000 Ib on the two-inch sample dises have been 
applied without damaging the cell. 

Although measurements could be made with ease in this cell it is not 
without disadvantages. The steatite ring has a rather high dielectric 
constant which tends to increase fringing effects. These effects are fur- 
thermore pressure dependent since the electrode separation varies as 
pressure is applied. Also loss measurements could not be obtained as 
leakage along the steatite surface was larger than the leakage through 
the samples of the polyethylene-butyl rubber compound investigated. 

An attempt was made to eliminate fringing effects by making meas- 
urements on samples of varying thickness and extrapolating to zero 
thickness but results were too uncertain to be of quantitative value. 
The uncertainty resulted from the inability to cast the sample discs 
with uniform thickness an effect which becomes pronounced with very 
thin samples. It was possible, however, to estimate the total stray 
capacitance in this manner and it was found to be about 10 per cent of 
the sample capacitance and only slightly dependent upon pressure. 
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Fig. 1 Cell employed to measure (1/C)(aC/aP) with sheet specimens under 
one dimensional pressure. 
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Also, due to non-uniformity of the thickness of the specimens, it was 


found that the capacitance-pressure coefficient was larger at low pressures 


than at high pressures. This was apparently due to the initial squeezing 
out of voids between the electrodes and samples and the difficulty was 
removed by applying silver paint electrodes to the sample dises. 

A Western Electric capacitance bridge was used to make capacitance 
measurements with this cell. The temperature was kept at 25°C and 
the humidity of the room was maintained at 50 per cent. The frequency 
used was 10 ke. 

It is believed that the fact that the steatite is much more rigid than 
the specimens makes this experimental arrangement closely approximate 
the assumptions made in deriving (8) (i.e. lateral strains are negligible). 
It is important, however, that the specimens be cut to fit the steatite 
ring very closely. 

Experiments which correspond to the cylindrical capacitor under 
biaxial stress have been performed as follows.* The specimens in this 
case were lengths of cable which consisted of a copper wire central 
conductor surrounded by the polyethylene-5 per cent butyl rubber 
compound. b/a was 3.87 in most of the measurements but some data 
were obtained for b/a = 4.68 (actual dimensions 0.620”/0.160” and 
0.750”, 0.160”). Twenty foot lengths of cable were placed in a long tank 
provided with a seal at one end. The end of the cable inside the tank 
was closed such that the center conductor was isolated. The tank was 
then filled with water which served as the outer conductor, tap water 
having sufficiently high conductivity. Pressure was applied by the 
water. 

A Leeds and Northrup capacitance bridge was used and the measure- 
ments reported were made at 10 ke. 


IV. RESULTS 


It is experimentally observed in all the cases considered that plots of 
C versus P are nearly linear for polyethylene-5% butyl rubber. This 
may be shown to be in agreement with the foregoing theories as follows. 


Equation (8) may be written 


c(P) P 9 
awe =f dP (28) 


“0 


* The investigation of radial compression on eyvlindrical (cable) specimens was 
carried out by A. W. Lebert and O. D. Grismore of Bell Telephone Laboratories, 
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The integrand is approximately constant so 


C(O) 


C(P) = C00) p\ 


kP = 10~ for the highest pressures used in the present experiments and 


(e — 1)(e + 2) 


+1 


is of the order of unity so the exponential may be expanded as 


C(P) CO) 41 + |] 

3 
This treatment applies only to dielectrics for which ¢, /, and ¢ are in- 
sensitive to pressure. Equations (13) and (25) may be treated similarly. 
Values obtained experimentally and theoretically for polyethylene- 
5% butyl rubber are compared in Table I. The experimental values 
represent averages of many measurements. Agreement is considered 
adequate but more careful experiments are needed. The necessary param- 

eters assumed in making these comparisons are: 


2.28 
2.14 X 10 °/psi 
o = 0,50 


TABLE [ EXPERIMENTAL AND THEORETICAL VALUES FOR POLY- 


aC 


Sample Pressure } 


| Experimental Theoretical 


| 


Sheet .| one-dimensional 3.3" 
Cable = 3.87 radial 2 
b/a = 4.68. radial 2.22 


3. 


* This value has not been corrected for stray capacitance. Such a correction 
would tend to make the agreement between experimental and theoretical results 
better. 
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SUMMARY 


Equations relating electrical capacitance and pressure have been 
derived for plane capacitors under one dimensional and hydrostatic 
pressures and cylindrical capacitors under radial pressure. The dielectric 
material has been assumed to be an elastic solid but the relationships 
also apply to fluid dielectrics when Poisson’s ratio is set equal to 3. 
Experiments corresponding to the assumptions have been described 
briefly and experimental results are found to be in agreement with the 
theoretical predictions. 

The results are of practical value in making estimates of the de- 
pendence of attenuation of submarine cables on pressure. The equations 
may also be put in forms useful for determining the dependence of the 
dielectric constant on pressure from capacitance measurements. 
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The Effects of Surface Treatments on Point Contact Transistor 
Characteristics by J. H. Forster and L. E. Miller, B.S.T.J., 35, pp. 
767-811, July, 1956. Figs. 3, page 776, and 10, page 787, were inad- 
vertently interchanged. 


Cable Design and Manufacture for the Transatlantic Submarine 
Cable System by A. W. Lebert, H. B. Fisher and M. C. Biskeborn, 
B.S.T.J., 36, pp. 189-216. Table I, page 3, the material for type B 
armor wire should be medium steel instead of mild steel. Page 207, 
the equation for Zo should read 
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Reading Rates and the Information Rate of 
a Human Channel 


By J. R. PIERCE and J. E. KARLIN 
(Manuscript received August 31, 1956) 


The limitation on the rate at which information can be transmitted over an 
ordinary telephone channel is a human one. In this study people read words 
as fast as they were able to; from these results some deductions are made about 
the capacity of a human being as an information channel. The discrepancy 
between human channel capacity measured thus (40-50 bits/sec) and tele- 
phone and television channel capacity (about 50,000 bits/sec and 50,000,000 
bits/sec respectively) is provocative. 


INTRODUCTION 


In communication over an ordinary telephone channel, the limitation 
on the rate at which information can be transmitted appears to be a 
human one. For instance, by use of a vocoder, the required channel 
‘apacity can be reduced greatly with only a moderate reduction in the 
quality of the reproduced speech.! 

It would be of great interest to measure the information rate necessary 
to provide a satisfactory sensory input to a human being. It is not clear 
how this could be done. Something which may be related and for which 
a lower bound can be measured is the capacity of a human being as an 
information channel. 

An evaluation of and understanding of the limitations on the informa- 
tion rate of the human channel might ultimately be of practical im- 
portance for two reasons. First, it might help to tell us what sort of task 
to set a human being when he is necessarily a part of a system involving 
information transmission. Thus, a man can transmit information faster 
by reading than by tracking. Secondly, the understanding might some- 
what illuminate the problem of the channel capacity necessary to provide 
a satisfactory sensory input, and so might help to reduce the channel 
capacity required in electrical communication between human beings. 

Previous investigations indicate?» * that reading aloud attains the 
fastest rate at which a human being can be demonstrated to transmit 
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information, as contrasted with, for example, typing, playing the piano, 
or tracking.* 

The work presented here, while undertaken independently, is in 
general similar to and in agreement with that reported for reading rate 
experiments by Licklider, Stevens and Hayes,’ and by Quastler and 
Wulff. However, we have considered some factors in more detail than 
these workers, and also, contrary to the former group, we find that, 
under optimal conditions, reading with tracking has a lower information 
rate than reading alone. 

The chief problem investigated was: 

(1) Taking people as they are, with no additional training, how fast, 
in bits per second, can they transmit information by reading? 

(2) What principal factors control this limiting rate? 

The experimental procedure consisted simply of people reading aloud 
as rapidly as they could typed lists of words. Each list was composed of 
a single vertical row of 12 groups of 5 words, giving a total of 60 words 
per page. In each instance, the words were chosen at random from a 
given vocabulary of words. If n is the number of words in the vocabulary 
and if the words are chosen with equal probabilities, and if all words are 
read correctly,? the amount of information which is conveyed or trans- 
mitted through the human being measured in bits is‘ 


logs n bits/word 


When the vocabulary for a particular experiment has much fewer 
than 60 words, certain words must necessarily be repeated several times 
within a list. When the vocabulary is much greater than 60 words, repe- 
titions are necessarily few and differences in reading rate among different 
vocabularies would be expected only if the vocabularies differed in 
nature, as in syllable length or familiarity of words. 

Unless otherwise specified, each result quoted below is the average 
reading speed for two lists for each of three readers, chosen as repre- 
senting fast, medium and slow readers for people with at least a high 
school education. The results on these three readers are substantially 
similar to those on ten similar readers used in preliminary experiments. 
The chief experiments performed, and some interpretations of them, 
follow under numbered headings. Some supplementary experiments are 
then described briefly and the over-all results are commented on. 

* Here tracking means successively pointing to a series of marks. 


+ In preliminary experiments the reader’s voice was recorded, and it was found 
that errors in reading aloud oceur very seldom if ever. 


| 
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PRINCIPAL EXPERIMENTS 
Experiment 1: Effect of Vocabulary Size 


The larger the vocabulary size the higher the information rate con- 
veyed by a given word reading rate. However, one might think that it 
would be possible to read randomized lists of, say, 4 words substantially 
faster than lists of 8, or 16 or more words.* How is the word rate affected 
as the vocabulary size increases? 

To investigate vocabulary size as such, it is necessary as far as possible 
to avoid the influence of differences in word length or familiarity. To 
this end, words in each vocabulary were chosen at random from the 500 
most common words in the language;® a few words were then changed so 
as to keep an average of 1.5 syllables/word for each vocabulary. Figs. 
l(a) and 1(b) show parts of typical lists for vocabulary sizes of 2 and 
256 words respectively. The order of reading the different size vocabu- 
laries was randomized. 

Fig. 2 shows that reading rate is essentially independent of vocabulary 
sizes from 4 to 256 words when familiarity and word length are kept fairly 
constant. The reading rates for the three readers for the 256-word vo- 
cabulary are 3.8, 3.7 and 3.0 words/sec, giving information rates of 30, 
30 and 24 bits/sec respectively. 

The word rate for a 2-word vocabulary is systematically a little greater 
than for larger aera This effect, which is statistically significant, 
is best seen in Fig. 2 in the average curve (dashed). The writers feel on 
the basis of ealbeitien; impressions that this may result from a tendency 
to group words in pairs in recognizing and speaking them. Among 2 
words there are only 4 ordered pairs. It is apparent from the data that 
no such effect is noted among the 16 ordered pairs occurring with the 
4-word vocabulary. 

The last point on the curves in Fig. 2 illustrates the importance of 
familiarity and word length. When words are taken at random from a 
5,000-word dictionary (12.3 bits/word), the reading rates drop to 2.8, 
2.7 and 2.1 words/sec, yielding information rates of 34, 33 and 26 bits/sec 
respectively, which are very close to the rates 30, 30, 24 for the 256-word 
vocabulary. 

However, these dictionary lists involve some unfamiliar words and 
average 2.2 syllables/word. 


° When the light is very dim, the reading rate is slowed, and is faster for small 
vocabularies than for large voce: abularies. Reading tests were done at normal light 
levels, which are very much brighter than those at which a slowing due to inade- 
quate illumination is observed. 
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Experiment 2: Effect of Word Length and Familiarity 


It was not clear from Experiment 1 how much of the drop in word 
rate for the dictionary list was affected by decreased familiarity and how 
much by increased word length. 

These two variables were then untangled in a separate experiment. 
Word lists were prepared which kept both length and familiarity rela- 
tively constant for a given list. The words were chosen from a list of the 
20,000 most frequently encountered words in the language.® Reading 
rates were measured for the thousand most familiar words, for the ninth 
to tenth thousand most familiar, and for the nineteenth to twentieth 
thousand most familiar words. 

The results are shown in Figs. 3(a), (b), and (c). There is considerable 
consistency among readers as to the relative effect of length and famili- 
arity. The most familiar trisyllable words, for example, are read about 
as rapidly as the least familiar monosyllables. 

A confirmatory demonstration of the effect of familiarity upon reading 
rate is shown in Fig. 4. This shows reading rates for randomized lists of 
eight nonsense words averaging 1.5 syllables/word (e.g., jevhin, tosp) 
which are necessarily totally unfamiliar when the reader first encounters 
them. As the reader becomes more familiar with the words on successive 
readings, his word rate increases until he approaches the rates of familiar 
words in Fig. 2. 


Experiment 3: Preferred Vocabulary for Increasing Transmission Rate 


The transmission rate is the product of the reading rate and the 
logarithm to the base 2 of the vocabulary size. To maximize the rate we 


Fig. 1 — Parts of typical lists for vocabulary sizes. 
£ yp 


grew foot 
action tomorrow 
grew count 
grew issue 
action rain 


action month 
grew earth 
grew cook 
action build 
action corner 


grew yard 
action history 
grew forest 
action pleasant 
grew wrong 
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Fig. 2 — Reading rate is essentially independent of vocabulary sizes under 
certain conditions. 


must make each of these factors large. As Fig. 3 indicates, reading speed 
tends to decrease as vocabulary size increases. From the data in Fig. 3, 
a rationale (shown in Appendix I) was developed for use in searching 
for an improved vocabulary which would maximize transmission rate. 
This indicated that the 2,500 most familiar monosyllables chosen with 
equal probability should form a very good vocabulary and one which is 
simple to construct and use. For a 2,500-word list we have 11.3 bits/ 
word. 

Reading speeds for such preferred lists were 3.7, 3.4 and 3.0 words/ see, 
giving information transmission rates of 42, 39 
data on the distribution of this rate found among Bell Telephone Lab- 
oratories employees is given in Fig. 5. 


and 34 bits/sec. Some 


Experiment 4: Prose and Scrambled Prose 

The experiments above were all with discrete words. Reading rates 
for non-technical prose* are appreciably higher — 4.8, 4.7 and 3.9 
words/sec for the three readers. However, such prose has a good deal 
of redundancy. Shannon? arrives at a figure of around 1 bit/letter for a 


* Extracts were taken from New York Herald Tribune, the novel ‘‘East River” 
by Sholem Asch, ‘‘Vermont Tradition’? by Dorothy Canfield Fisher and the 
Scientific American. Such material was chosen as being of the same sort of prose 
as was used by Dewey® in his word counts from which Shannon’? made his estimate 
of information content of printed English. 
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27-word alphabet including the space, or 5.5 bits/word for the average 
of 4.5 letters/word plus one space following a word. Newman and 
Gerstman® give a figure of 2 bits/letter. It is quite uncertain, however, 
what the true value may be. Table I compares the information rate for 
the preferred list with that for prose assuming 5 and 10 bits/word. 

When words were taken at random from the same prose sources, the 
reading rates dropped to 3.7, 3.3 and 2.7 words/see. These rates are 
about the same as for the preferred list. 

The information content of scrambled prose can be estimated much 
more accurately than that for prose, since the correlations associated 
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Fig. 4 — Confirmatory demonstration of the effect of familiarity upon reading 
rate. 


with word order have been removed and the bits/word depend only on 
the frequency of occurrence of words in prose, which is known. Thus, 
Shannon’ gives a figure of 11.82 bits/word which applies to scrambled 
prose, provided the prose has the same word frequencies as that from 
which the statistics were derived. The information rates for words from 
a 5,000-word dictionary (Experiment 1) for the preferred lists, and for 
scrambled prose are given in Table IT. 

The information rate for scrambled prose is less reliable than the 
others, because we are not sure that the word frequencies used by 
Shannon apply to the prose used by us, but we used the type of material 
cited by the reference he quotes. It is clear that the information rate 
for scrambled prose is high as compared with most other lists. 

Table II shows the gain which may be made by fitting the task to the 
human being -— in this case, by choosing a suitable word list. We may 
note that the gain appears greater in the case of reader A than in the 
case of reader B. This need not be experimental error. One would sup- 
pose that there are optimal lists for individuals. Indeed, if we compare 
Figs. 3(a) and 3(b) we see that for reader A the word rate for mono- 
syllables drops by a factor 0.72 in going from the first thousand to the 
tenth thousand, while for reader B the drop is only a factor 0.88. This 
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Fig. 5 — Distribution of reading rates for preferred vocabulary. 


indicates that the optimal list would be somewhat different for reader A 
than for reader B. More extensive data would, however, be required to 
confirm this hypothesis. 

Experiments not described here in detail showed that reading rates 
for digrams (successive pairs of words related as in English text) are 
intermediate between those for prose and discrete words. 


Experiment 5: Effect of Multiple Channels 


Licklider' has found that when the reader attempts simultaneously to 
perform a tracking operation while he is reading, his reading rate re- 
mains almost unimpaired, and the tracking information is added to 
that of reading alone. This two-channel transmission gave him his 
highest rate of transmission. We obtained the reverse finding. Reading 
the preferred list gave us our highest transmission rate. Simultaneous 


TABLE I 


Information Rate (bits/sec) 


Material 


Proge (6 bits/word).... 24 23 19 
Prose (10 ......... 00050006. 48 
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TABLE II 


Information Rate (bits/sec) 


Material 


5,000-word dictionary. ... 
Preferred list 
Scrambled prose 


reading and tracking gave a lower total transmission rate. However, 
Licklider and we agree on the magnitude of this maximum — between 
40 and 45 bits/sec for facile test subjects. 

Measurements on combined reading and tracking rates were made in 
Experiment 5 using words from the preferred lists. Whereas Licklider’s 
readers made a dot within a box next to the word read, our readers 
placed a dot as close as possible to a vertical line next to the word read 
(e.g. dog |-). The computation of transmission rate is shown in Ap- 
pendix II. The reading-while-tracking rates were 2.4, 2.0 and 1.4 words 
sec. The computed information rates are given in Table III. 

It may be seen that the reading rate during tracking dropped so much 
that the two channels together give a total information rate less than 
those for reading the preferred list alone. Licklider’s reading lists were 
words chosen randomly from a dictionary and are presumably not chosen 
optimally for maximum information rate ——his information rates for 
reading alone were 30-35 bits/sec, as compared with the 32-43 bits ‘sec 
found here for the scrambled prose and preferred lists. However, if we 
assume that our reading-while-tracking rate, which is much slower than 
the reading rate for scrambled prose or for the preferred lists, is limited 
largely by tracking, we might have obtained a slightly higher informa- 
tion rate in reading-while-tracking by using a larger list of words. This 
is suggested by the fact that Licklider’s and our experiments obtain 
about the same reading-while-tracking speeds. 


TABLE III 


Information Rate (bits/sec) 


Reading (while tracking) 

Tracking (while reading) 

Reading and Tracking 

(Rates for same word list from Experi- 
ment 3 — reading only) 


37 3 33.1 at. 
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Experiment 6: Effect of Physiological Utterance Limitations 


One of our best indications that the maximum reading rate of a 
subject is determined by mental rather than by physical limitations is 
that discrete word lists were read no faster silently than aloud. This 
may appear contrary to very high silent reading rates widely quoted. 
This can be explained by the fact that in reading much prose we do not 
and need not recognize every word in order to get the sense. Presumably, 
if an author made every word say something, his prose could not be 
read with understanding at such high rates. 

We can also show in another way that the mere uttering of the words 
does not determine the reading speeds observed. A memorized prose 
phrase (‘This is the time for all good men to come to the aid of their 
country’’) was repeated several times at rates of 7.5, 9.1 and 8.4 words / 
sec for the three readers. 

Fig. 6 compares word rates for repeating a phrase with the word rates 
previously discussed. The radically faster rate for repeating a phrase is 
not the only feature to be observed in this figure; the three readers are 
not in the same order of speed as is preserved through the reading 
experiments. This would suggest that it is word recognition rather than 
speaking speed which accounts for differences among the reading rates 
of different people. 
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READING RATES 
DISCUSSION AND SUPPLEMENTARY EXPERIMENTS 


Conclusions from Principal Experiments 


The conclusions which can be reached with reasonable assurance from 
these experiments are rather narrow. They might be stated: 

1. Information is best transmitted through a human channel by 
means of well-chosen acts (reading well chosen words in this case) in- 
volving many bits per act, that is, much choice per act. Cutting down 
drastically the bits per act does not substantially increase the speed at 
which the individual act is accomplished. 

2. The lower bound of information transmission through the human 
channel of rapid readers seems to be about 43 bits/sec. This estimate is 
a little higher than that found by Licklider,? and may be close to a 
limiting rate. 


3. This limiting rate can be achieved by the simple act of reading 
either randomized lists from suitably selected words or scrambled prose. 


4. Both familiarity and length of words are important in determining 
reading speed. The relative effect of these two variables on reading speed 
is rather complex. 

Beyond these narrow conclusions, there is much understanding yet to 
be achieved in the general field of the speed of human mental and 
physical responses and operations. Thus, it seems worth while to men- 
tion other experiments which were done in the course of the present 
investigation and experiments carried out by other workers, and to 
speculate somewhat concerning the whole of this experimental work. 


Multiple Tasks 


The reading-while-tracking experiments touch on an important prob- 
lem. We have all heard of wireless operators who can receive and sub- 
sequently type out a message while carrying on a conversation or playing 
chess. There is nothing in this feat to indicate an information rate greater 
than that we have found. Actually, the rate of receiving prose by Inter- 
national Morse Code by ear is around 0.58 word/sec ;° this is slow com- 
pared with the rates we have considered. 

Our experiments with tracking followed experiments in which words 
in the lists were randomly printed in red or black, and in which the 
subject spoke red words in a louder tone of voice than black words, or 
pressed one key for red words and another for black words. In these 
cases, the added information, one bit per word, was so small as to make 
no clearly discernible difference in information rate for the large vo- 
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cabularies. The speed for reading loud and soft was less than for reading- 
while-keying. This may imply something about the relative efficiency of 
human beings performing two tasks by using two sets of muscles as 
against using one set in two different ways. 

It is common experience that we can walk about and carry out other 
simple tasks while talking or thinking. It is possible though not obvious 
that some sort of automatic, almost purely reflexive response — as, 
moving the left hand when the right hand is touched — could with 
practice be carried out quite independently of a task such as reading. 
The information rate for such responses would be small, the experimental 
error would make it difficult to settle the question, and the interpretation 
of such an experiment would not be entirely clear. 


The Patterns Which Govern Reading Time 


Early in the experiments the question was raised whether readers may 
not read letter by letter or syllable by syllable. Several findings bear on 
this. 

Fig. 3 shows clearly that the reading time for a two-syllable word is 
much less than twice the reading time for a one-syllable word. 

One of us knows a negligible amount of German. German syllables are, 
however, reasonably familiar. It was found that in reading German 
aloud he had the same reading rate in syllables per second as a man 
whose native language was German had in words per second. The two 
readers had substantially the same reading speed in English. Presumably 
in reading German one man recognized syllables and the other recog- 
nized words. This also reinforces the conclusion that reading rate is not 
limited by the time taken to utter words. 

Some experiments were done using lists of common Chinese characters 
and lists of the corresponding English words. Average word rates over 
three lists for two readers who could read both languages are given in 
Table IV. The slightly lower rate for English is plausibly explained by 
the fact that Chinese was the reader’s native language. All words were 


TABLE IV 


Words/sec 
Reader 
Chinese Words English Words 


\ 
| 
27 
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READER 


SECONDS PER 60 WORDS 


2 
SYLLABLES PER WORD 


Fig. 7 — Patterns governing reading time. 
necessarily monosyllables in Chinese and happened to be monosylla- 
bles in English. 

In one case a word is made up of a sequence of letters, each standing 
for a sound, and in the other it is made up of a number of strokes which 
are meaningless individually, yet in each case a word is taken as a unit 
or pattern requiring nearly the same time for reading. 


We found that the rate for reading arabic numerals is substantially 


the same as for reading familiar words. Each numeral is an individual 
pattern to be recognized. 

In a first effort to find the effect of syllable length on reading rate, 
a subject read several lists made up respectively from vocabularies of 
16 single-syllable, 16 two-syllable, 16 three-syllable, and 16 four-syllable 
words. None of the words was very unfamiliar to start with, and all were 
presumably very familiar after the subject had read several randomized 
lists composed of the same words. 

The outcome of the experiment is shown in Fig. 7. For comparison, 
the points associated with the lower straight line are time for 60 words 
for repeating, as rapidly as possible, a one-, a two-, a three- and a four- 
syllable word. The points on the upper curve are reading time for 60 
words for the randomized lists of the familiar one-, two-, three- and four- 
syllable words. 

In dealing with such groups of highly and uniformly familiar words, 
it appears that, roughly, a certain time is required to recognize the word 
regardless of length, and this time governs the reading rate up to the 
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TABLE V 


Words/sec 
Reader 


Scrambled Prose Scrambled Paragraph 


3. 
3. 


2. 


point at which the reader is uttering words continuously as fast as he 
can. This is consistent with a strong subjective feeling that what limits 
the rate is the difficulty of ‘‘recognizing’”’ the word as one looks at it, 
and that once the word is recognized one can utter it while recognizing 
the next word. 

It would of course be wrong to conclude from this experiment that 
multisyllable words are in general recognized as quickly as single syllable 
words, for it would be possible to recognize one among a known group 
of 16 multisyllable words without looking at the whole word. Indeed, 
Fig. 3 indicates a substantial difference of reading rate between one- and 
two-syllable words of like frequency of occurrence. This was not ob- 
served in reading the specially familiar lists of one- and two-syllable 
words, 

Why is prose read faster than scrambled prose? It might be that some 
short phrases are recognized as individual patterns. However, there is 
another factor at work. A scrambled paragraph of prose is read slower 
than the same paragraph in its natural word order but faster than 
scrambled prose from a book or a long stretch of prose, as can be seen 
from Table V. 

It should be noted that reading speed differs for different prose, and 
that when comparisons among prose, scrambled paragraphs and scram- 
bled prose are made, similar material should be used. 

The fact that a scrambled paragraph is read faster than scrambled 
prose might be explained by saying that we expect, we are more ready 
to recognize, words which are repetitions of earlier words or words which 
are closely related in sense to earlier words than we are unrelated words. 
Thus, the greater reading speed for prose than for scrambled prose seems 
to be due only in part if at all to the recognition of phrases rather than 
words as individual patterns. 


Rate of Mental Processes 


The rate at which information passes through a human channel in 
reading experiments is indisputable. Quastler? has attempted to go 
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beyond this and estimate information processing rates in the brain from 
the performance of lightning calculators, by dividing the performance 
of the calculation into a sequence of tasks equivalent to consulting 
memorized multiplication tables and performing additions. It is hard to 
interpret such a study clearly, for it is quite possible that there are many 
sorts of mental acts which take different times to perform, just as multi- 


plication and addition take different times in an electronic computer. 
A tentative experiment we performed indicated something of the sort. 

Randomized lists were made up from vocabularies (a) of names of 
common animals and vegetables in equal numbers, and (b) of common 
men’s and women’s names in equal numbers. In reading these, a subject 
was asked, not to read the word aloud, but merely to press one key with 
his right and another key with his left hand; in (a) left-animal, right- 
vegetable; in (b) left-man, right-woman. The same subject later read the 
lists aloud. Pressing keys took 40 per cent longer than reading aloud. 
(The additional time is not related to the keying operation itself; for a 
2 word list, for example, keying speed is much faster than reading speed.) 
Presumably an additional mental operation was involved, but it was not 
one for which the time was equal to that for reading. This experiment 
was not pursued further, partly because no clear conclusion could be 
drawn from it. Had it been pursued and randomized lists of the same 
words used repeatedly, the rate might have gone up. Conceivably, cow 
and horse could become for a subject merely different ways of spelling 
left, and lettuce and carrot variant spellings of right. In this case we 
would end with a two-word reading experiment. 


Reading Rate as a Psychometric Datum 


It is interesting to speculate on the possible relationship of reading 
rate to general intelligence or some other aptitude. Certainly, Fig. 5 
indicates some such relationship. We might also ask in connection with 
Fig. 3, does a rate which falls less rapidly with frequency of occurrence 
indicate a larger vocabulary, and can we measure vocabulary by reading 
speed tests? Certainly, measuring the speed of reading aloud is very 
simple, and such tests might have some psychometric utility. 


The Channel Capacity Required for Satisfactory Communication 


In conclusion, we cannot help but wonder that the highest information 
rate noted — 43 bits/seeond-—-is so much lower than the channel 
‘rapacity* of a telephone or a television circuit (around 50 thousand 


* This is the limiting channel capacity given by (2.1) of Appendix II. The prac- 
tical rate at which binary digits can be sent over a telephone circuit with simple 
equipment is less than 1000 bits/second. 
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bits/second for telephone and 50 million bits/second for TV). This 
would not be surprising if the limitation we observe had been one of the 
speeds at which words can be uttered, but it appears rather to be a 
mental one, one of recognizing what is before the eyes. To the authors, 
it seems reasonable that this mental limitation may apply to a human 
being’s ability to absorb information, that is, to the information rate 
needed to present a satisfactory sensory input to a human being. If it 
does, then why do we need so much channel capacity to convey to him 
an acceptable sound or picture? 

This can be explained in part by the inefficiency of our present com- 
munication methods. Despite its present imperfections, the vocoder 
makes it clear that clearly understandable speech can be transmitted | 
using far less channel capacity than that required in ordinary telephony.! 

However, it is quite likely that even with the most efficient of en- 
coding means we will have to use far more than 43 bits/second for a 
picture transmission channel. While only a portion of the image of the 
transmitted picture falls on the fovea at any instant, we can cast our 
eyes on any portion of the received picture. If the pick-up camera device 
and the received picture followed eye movements, a much less detailed 
picture would serve. Even with our eyes fixed, we can concentrate our 
attention on a particular part of our field of vision, and this is something 
that the pick-up camera cannot track. There may be similar effects in 


our apprehension of sounds. 

In the light of present knowledge it is impossible to estimate the 
minimum channel capacity required to transmit sound and pictures in a 
satisfactory manner. It will take work far beyond the measurement of 
reading rates to enable us to make such an estimate. 
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APPENDIX | 


ON OBTAINING GOOD VOCABULARIES 


The experiments in the body of the paper indicate that the choice of a 
good vocabulary is important in attaining a high information rate in 
reading lists of words. 
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In these experiments the randomized lists may be regarded as an in- 
formation source consisting of a sequence of code elements or symbols 
(words) in which there is no correlation between successive symbols. If 
in making up the lists the sth word of the vocabulary is used with a 
normalized probability p, , the entropy in #/ in bits per word, and hence 
the amount of information per word, is 


H = -> ps loge p, bits (1.1) 


If all words appear with an equal probability 1/m where m is the number 
of words in the vocabulary, as in the case of experiments 1-3, p, is 1/m 
for each of the m words and in this special case 


IT = loge m (1.2) 


In the case of scrambled prose, for instance, the probabilities are 
different for different words. This will be true also if in making up word 
lists we choose words randomly from a box containing different numbers 
of different words. 

Let ¢, be the time taken to read the sth word of the vocabulary. Let 
us assume that t, is the same for the sth word no matter what context that 
word appears in in the randomized list. If this is so, the average reading 
time per word, ¢, will be 


t= > pit. (1.3) 


the word rate will be 1/t, and the information rate R will be 


H Ds Ps 
t Dd 


(1.4) 


Suppose we have available a vocabulary of words and know the 
reading time ¢, for each word. The problem is to choose p, in terms of ¢, 
as to maximize PR. This is easily done; however the result can also be 
obtained as a special case of the problem treated in Appendix 4 of “The 
Mathematical Theory of Communication.’* In Shannon’s ¢;;, the sub- 
scripts 7, j refer to passing from state 7 to state j. In our case there is 
only one state, and ¢;;" should be identified with ¢, for all i and j, 
Similarly, we identify p,;“° with p,. C is the maximum rate, so logs 
W = C. Shannon’s equation 

B; 


B. W 
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becomes 


since there is only one B. 
Shannon’s determinantal equation 


ws” — = 0 


becomes 


(1.6) 


In (1.5) and (1.6) we have a means of evaluating p, in order to attain 


the maximum information rate R. 

The data we actually have concerning words is that for some class s 
of words, say, the monosyllables in the 8,000-9,000 words in order of 
familiarity, the reading time has some value ¢,, presumed to be the 
same for all words in the class, and that there are NV, words in this class. 
In this case we must assign to each word in the sth class the same prob- 


ability p, given by 


(1.7) 


and we must have 


= = 1 


Using the same amount of data given in Fig. 3, for the 20,000 most 


common words, but for a different reader, estimates were made of .V, 


and ¢, for all the classes consisting of words of each number of syllables 

in each range of occurrence of 1,000 words. Then the optimum values of 

p, for words in each class and the maximum rate /? were computed. 
Using (1.4), rates were also computed for choosing words with equal 


probability from among the first m thousand words and from among 


the first m thousand monosyllables, as functions of m. These rates had 


TABLE VI 


Nature | 
| 


Maximum for equi-probability among words of all lengths 
(first 5,000 words) 30.2 
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maxima for vocabularies of optimal sizes. Table VI compares the various 
rates computed. 

As it is much easier to make up lists from the 2,500 monosyllables 
among the first 8,000 words with equal probabilities than it is to make 
up lists from among all words with a different probability for each class, 
and as the information rates computed were close together, the former 
alternative was chosen. 

The use of scrambled prose provided an easy way to make up good 
lists. 


APPENDIX II 
TRACKING EXPERIMENT 


A well-known formula for channel capacity R in bits, sec is* 


2 
R = B loge{ 1+ (2.1) 
n 
This gives the limiting rate at which information can be transmitted 
over a channel with a bandwidth B by a signal of power P, , in the pres- 
ence of a gaussian noise of power ?,, , with an error rate smaller than any 
assignable number. 

In most cases, the actual rate is much smaller than this limiting rate. 
In general, the rate is the entropy of the received signal minus the 
entropy of the noise. In the particular case of a gaussian signal source as 
well as a gaussian noise, each represented by 2B samples a second, the 
calculation based on entropies gives exactly (2.1). Let us then apply 
(2.1) to the tracking experiment. 

Suppose that a large number .V of samples do have a gaussian dis- 
tribution of mean square amplitude x?. Suppose that we make an error 
d, in reproducing the nth sample, that these errors are gaussian, and 
that the mean square error is a? 


We see from (2.1) that ideally we can use these reproduced samples 
to transmit J/ bits of information where 


9 2) 


M = loge (1 + 


In the reading and tracking experiments, randomized words from the 
2,500 commonest monosyllables were arranged with equal vertical 
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spacings but with various horizontal positions. ‘To the right of each word 
Was a short vertical line. The distances wx, of these lines from the vertical 
centerline of the paper were obtained from a list of random numbers 
with a gaussian distribution such that for the list 22 = 1 inch. Of course, 
w for each list would depart from this value. As the words were read, the 
reader used a pencil to make a dot as near as possible to the correspond- 


ing vertical line. For each sheet, the departures d,, from the vertical 
lines in inches were measured and d? was computed. The number of bits 
M for pointing for that sheet were then taken as 


4 


\ l 
M = loge 1+ - 
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Binary Block Coding 
By S. P. LLOYD 


(Manuscript received March 16, 1956) 


From the work of Shannon one knows that it is possible to signal over an 
error-making binary channel with arbitrarily small probability of error in 
the delivered information. The effects of errors produced in the channel are 
to be eliminated, according to Shannon, by using an error correcting code. 
Shannon’s proof that such codes exist does not provide a practical scheme 
for constructing them, however, and the explicit construction and study of 
such codes is of considerable interest. 

Particularly simple codes in concept are the ones called here close packed 
strictly e-error-correcting (the terminology is explained later). It is shown 
that for such a code to exist, not only must a condition due to Hamming be 
satisfied, but also another condition. The main result may be put as follows: 
a close-packed strictly e-error-correcting code on n,n > e, places cannot 
exist unless e of the coefficient vanish in (1 + x)*(1 — x)" !* when this is 
expanded as a polynomial in x. 


I. INTRODUCTION 


In this paper we investigate a certain problem in combinatorial 
analysis which arises in the theory of error correcting coding. A develop- 
ment of coding theory is to be found in the papers of Hamming! and 
Shannon’; this section is intended primarily as a presentation of the 
terminology used in subsequent sections. 

We take (0, 1) as the range of binary variables. By an n-word we mean 
a sequence of n symbols, each of which is 0 or 1. We eall the individual 
symbols of an n-word the letters of the n-word. We denote by B, the 
set consisting of all the 2” possible distinct n-words. The set B, may be 
mapped onto the vertices of an n-dimensional cube, in the usual way, 
by regarding an n-word as an n-dimensional Cartesian coordinate ex- 
pression. The distance d(u, v) between n-words u and v is defined to be 
the number of places in which the letters of « and v differ; on the n-cube, 
this is seen to be the smallest number of edges in paths along edges be- 
tween the vertices corresponding to vu and v. The weight of an n-word x 
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is the number of I’s in the sequence uw; it is the distance between u and 
the n-word 0O---0, all of whose letters are 0.* 

A binary block code of size K on n places is a class of K nonempty dis- 
joint subsets of Bb, where in each of the K sets a single n-word is chosen 
as the code word of the set.t Each such set is the detection region of the 
code word it contains, and we shall say that any n-word which falls in a 
detection region belongs to the code word of the detection region. The 
set consisting of those n-words which do not lie in any detection region 
we call limbo.f A elose packed code is one for which limbo is an empty 
set; i.e., a code in which the detection regions constitute a partition 
(disjoint covering) of B,, . 

A sphere of radius r centered at n-word w is the set [v:d(u, v) < rj of 
n-words v which differ from wu in r or fewer places. A binary block code is 
e-error-correcting if each detection region includes the sphere of radius ¢ 
centered at the code word of the detection region. We say that a binary 
block code is strietly e-error-correcting if each detection region is exactly 
the sphere of radius e centered at the code word of the detection region. 

This paper is devoted to the consideration of close packed strictly 
e-error-correcting binary block codes. We shall refer to such a code as 
an e-code, for brevity. Hamming’ observes that a necessary condition 
for the existence of an e-code on 7 places is that 
l+n+- 


1) + (1) 


be a divisor of 2". In this paper we derive an additional necessary condi- 
tion. Our condition includes as a special case a condition of Golay* for 
the existence of e-codes of group type, and applies to all e-codes, whether 
or not they are equivalent to group codes.§ 
° * Tf B, i is regarded as a a subset of the real linear veetor space consisting of all 
sequences a = (a; ,a@2,°+* ,a,) of nreal numbers, then the “weight” of an n-word 
n 


is simply the norm all, = > la, |), and our ‘distance’ 


is the 
metric derived from this norm. 

+ The term “block code’’, due to P. Elias, serves to distinguish the codes of 
fixed length considered here from the codes of unbounded delay introduced by 
Elias, Reference 3. 

t In a communications system? using such a code, the transmitter sends only 
code words. If, due to errors in handling binary symbols, the receiver delivers 
itself of an n-word other than a code word then: (a) if the n-word lies in a detec 
tion region, one assumes that the code word of the detection region was intended; 
(b) if the n-word hes in limbo, one makes a note to the effect that errors have 
occurred in handling the word but that one is not attempting to guess what they 
were. 

The terms “group alphabet’? (Slepian®), ‘‘svstematic code’? (Hamming?!), 
“symbol code”? (Golay'), ‘check symbol code”’ (lias*), ‘parity check code’’, are 
roughly synonymous. More precisely, a group code is a parity check code in which 
all of the parity check forms are homogeneous (‘‘even’’), so that 00 +--+ 0 is one 
of the code words; see Reference 5. 
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If, DISTRIBUTION OF CODE 


WORDS 


Suppose an e-code on n places is given. Let us inquire as to the dis- 
tribution of weights of code words. We denote by », the number of code 
words of weight s, 0 S s < n, and by 


(2) 


the generating function for these numbers, with 2 a complex but other- 
wise free variable. We show in this section that G(x) satisfies a certain 
inhomogeneous linear differential equation of order e. 

If there exists an e-code on n places then this differential equation will 
have G(x) as a polynomial solution; the necessary condition for the 
existence of an e-code on n places given in Section 4 is essentially : 
restatement of this fact.* First, however, we must derive the differential 
equation and obtain its solutions. 

If w. is a code word of the given e-code (1 < a S< K), define the set 
of j-neighbors of wa as the set of n-words which lie at distance exactly j 
from wa ; designate this set by Sj;(wa). (So(wa) is the set whose only 
element is w, itself.) Our derivation is based on the observation that, in 
an e-code on n places, 


U 


a=1 j=0 


i Co 


S;(wa) = B, t (3) 


is a partition of B, . For, the detection regions: 


e 
U S,(w.), 

j=0 

are disjoint, and in each such sum representing a detection region the 
summands are disjoint (the distance function being single valued). 
Furthermore, each n-word of B, lies in some detection region (close 
packed property) and hence appears in one of the sets S;(w.) for some 
a and for some / satisfying 0 S j S e. 

The set 


S;(wa) 


a=l 


* The author is not yet able to demonstrate the converse. That is, suppose one 
obtains a polynomial solution G(z) of (11), below, satisfying appropriate boundary 
conditions, and from it some coefficients v, ,0 < s < n. It does not follow from 
the methods of this article that there is actually some e-code on n places for 
which these », represent the number of code words of weight s. 

+ U = set union. 
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consists of the n-words which are j-neighbors of some (not specified) 
code word; let us refer to these n-words simply as j-neighbors. Denote by 
vj,, the number of j-neighbors which are of weight s (with »,, = »,, as 
above). Applying (3) to the n-words of weight s, we see that 


8 


Vs + Vi + + 0 n (4) 


is the total number of n-words of weight s. If we multiply (4) by 2° and 
sum on s, we have 
G(x) + Gila) + +++ + Gx) = (1 + 2)" (5) 


where 


G(x) = Vj (6) 
s=( 


is the generating function (with respect to s) for the numbers v;,. . 

We now express G;(x), 0 < 7 S e, in terms of G(x). Suppose code 
word w is of weight s; that is, w consists of s ones and n — s zeros in 
some order. A j-neighbor of w is obtained by choosing j places out of n 
and changing the letters of w in these places, 0’s to 1’s and 1’s to 0’s. If, 
in this procedure, qg of the 1’s of w are changed to 0’s, so that 7 — q of 
the 0’s are changed to 1’s, then the resulting j-neighbor of w is of weight 


s— q+ (j — q). Now, there are . ways of choosing q places among 


n—s 
the s where the letters of w are 1, and there are independently ¥ ‘) 
ways of choosing 7 — q places among the n — s where the letters of w 


are 0. Thus, of the 6 different j-neighbors of w, the number (*) 


n—8 
. ‘) are of weight s + 7 — 2q. We may regard each of these as con- 

tributing 1-2°'’ “* to the generating function G(x) of (6) (provided 
0 <7 S e, so that there is no overlap); hence, summing over all j-neigh- 
bors of a code word and then over all code words, 


Gj(z) = (:\(" 0s ¢* (7) 
From the easily verified polynomial identity 
j=0 \G/\J — 4 


* The limits (0, x) on the g summation are merely for convenience; the bi- 
nomial coefficients vanish outside the proper range, under the usual convention. 
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(n, s integers, 0 < s < n) it follows that 


4 Jeo yi 


where contour C is, say, a small circle around the origin, taken posi- 
tively. Thus 


Ge) = Je dy 
_ 1 f (8) 
its 
where the operator L; is thus defined. Change of integration variable 
gives 
_a- a)" G(z) dz 
= c, (1 — — 
ttl af 
(1 G(z) (9) 
q! 02? (1 — | 
(" 4 a> ?(1 — 2°)? d?G(z) 
p=0 p! dx? 


(with C, a small circle enclosing x but not 2", x” # 1). Thus L; may be 
regarded as a linear differential operator of order j, (io = 1). 
Using this result, (5) may be given the form 


(1 + x)” = (Lo + L, + + LJG(z) 


1 fy anf t+ | 
= ( 
(1 + ry) a(; + xy dy 10) 
= MG(z) 


this last expression as a definition of operator 17. Written asa differential 
equation, (10) is 

e 2\p e—p P, 
(1 — 2)’ > (" d’G(x) 
r=0 r da? 


It is straightforward that the only singularities of this equation are 
regular singularities’ at ¢ = +1, ~. 


= (1+ 2)” (11) 


Ill. THE DIFFERENTIAL EQUATION 


In this section we discuss (11) without reference to the fact that G(x) 
is supposed to be a generating function. That is to say, with n and e 


|. 
a 
if 
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fixed but arbitrary non-negative integers, we denote by 


G(x) = »,2° (12) 
s=(0 


any solution of (11) regular in the unit circle. 
It proves convenient to introduce certain functions f,,¢(v) defined by 
faa(z) = (1 + 
(13) 


= 


s=0 


where the coefficients ¢,(n, &) are given by 


—, -{n—€& 
r=( r =F 


Here, £ is to be regarded as a free complex variable. By (1 + .x)§(1 — x)""§ 
we mean exp (£ log (1 + wx) + (m — &) log (1 — 2)), each logarithm 
vanishing at « = 0. As a function of 2 this function is single valued in, 
say, the x-plane cut on (— x, —1] and [1, ~), and the series (13) con- 
verges to it in: < ] 


Binomial coefficients are defined by 


= 
s} sIr(¢ + 1 — s) 


s> 0 


s 


when ¢ is not an integer, and ¢,(n, €) is seen to be a polynomial] in & of 


degree s: 


The recurrence relation 
go(n, + gi(n, &) + + ¢,(n, = — 1, &) (16) 
obtained by expanding the various factors [| in the identity 

(a + — 2)" — = + 2)" 
is an important one. We note also for reference that 


¢(n,n — = (—1)"¢,(n, é) 


g.(n,n) = (17) 


go (n, 


8 


(15) 
| 
: 
ite 
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valid for all n, € and non-negative integers s. We see, by the way, that 
¢-(n — 1, n) is simply the Hamming expression (1). 
The function f, ¢(a@) has the property that 


Jn + ry (1 + xy)” 


at least if, say, given x, | y | is small enough. From this and (8) for the 
operator 1; it is apparent that 


Lifn,e(x) = ¢i(n, (19) 
Similarly, using (19) and (16), or directly from (10) for the operator 1/, 
Mfne(x) = [Lo + La + + Ledlfne(x) 


If &3 is one of the roots of the polynomial ¢,(n — 1, &) then (20) be- 
comes 


(20) 


M( + — x)” = 0 


If we assume for the moment for simplicity that ¢.(n — 1, &) has e 
distinct roots 3, 1 S 8 S e, then (11) has as complementary function 


Ag(1 + x)*(1 — 
=] 


where the Ag are e arbitrary constants. 
Fortunately, the function (1 + x)" = f,,,(xv) is also a member of the 
family (13); hence 
M(1 + x)" = — 1, n)(1 + 2)” 
and the function 
2)" 
¢e(n — 1, n) 
is a particular integral of (11). [We see from (17) that ¢.(n — 1, n) does 
not vanish in cases of interest.| Finally, when the roots of ¢.(n — 1 
are distinct, the general solution of (11) must be of the form 
B=1 


1, n) 


=) 


If g-(m — 1, &) has multiple roots then the general solution will con- 
tain additional terms 


‘ 
(1 + x) (1 
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i.e., the derivative of with respect to with any positive 
integer less than the multiplicity of root &% . 

Before applying these results to e-codes in detail, let us derive a cer- 
tain modification of (21). First, we see from (17) that if m is a positive 
integer, then » is not one of the roots of g(a — 1, &). If the roots &3 
of g(n — 1, &) are distinct and if A3, 1 S 8 S e, are any e numbers 
then a polynomial @(¢) of formal degree ¢ is uniquely determined by the 
e + 1 conditions: 


= — nee (n — I, 
= 


using, e.g., the Lagrange interpolation formula. It is obvious that G(x), 
(21), may be expressed in terms of this polynomial as 


Gir) = + 2x) (1 x) 


— = 3 (24) 
2rivr — n)y.(n — 


where T is any simple closed contour surrounding the roots: n, & , &, 

- , € of the denominator of the integrand; (the numerator is an entire 
function of € provided 2° # 1). 

Analysis a little more detailed shows that even if ¢(m — 1, &) has 
multiple roots the general solution of (11) can be represented in the 
form (24), again with 6(£) any polynomial of formal degree e such that 
A(n) = 1. The e constants of integration appear as the e + 1 parameters 
of restricted by 6(n) = 1.T 

Expansion of the integrand in (24) according to (13) yields the form 


/ gs(n, E)A(E) dé s= 0, 1, eee 
I 


Jr (E — n)y.(n — 1, 


for the coefficients of G(x), (12). 
If we denote by 


L,G(r) = G(x) = (26) 
s=() 


the result of applying the operator L; to any solution (24) of (11), then 
it is straightforward that 


lg, (27) 


Grj(x) == | ( 


2ri Jr (— — n)g.(n — 1, é&) 

* The prime denotes differentiation with respect to €. 

+ If G(x) of (24) is to satisfy (11) it is sufficient that @(£) be any function regular 
within (and on) P and that @(n) = 1, as may be easily verified. Since G(z) depends 
on 6(é) only by way of the values of 6(€) at the zeros of the denominator in (24), 
the condition that @(£) be a polynomial of formal degree e serves merely to deter- 
mine 6() uniquely for a given solution G(z). 


3 
AS 
: 
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and that 


Jr (E — — 1, 


(An interesting reciprocity v;,, = vs,; is apparent from (28). In an e-code 
one has (number of j-neighbors of weight s) = (number of s-neighbors 
of weight 7) only for 0 S s,7 S e, since L,G(x) is the generating fune- 
tion for j-neighbors only if 0 S 7 S e.) 


= 


IV. BOUNDARY CONDITIONS 
The coefficients », , (25), of any solution of (11) satisfy the relation: 


+H = dé = 1 (29) 


2miJr&é—n 


by virtue of (16) and the normalizing condition 6(n) = 
With y an integer such that 0 S y S e, denote by 


G™ (x) = », 
s=0 


a solution of (11) which satisfies the e boundary conditions 
) 


? 1 in such a solution, from (29). Thus the condi- 


We must have pv, 
tions (31) are equivalent to specifying the values of G(x) and its first 
e — 1 derivatives at the ordinary point « = 0 of (11), so that such a 
solution G‘” (x) exists and is uniquely determined.° 

Given an e-code on n places, each n-word of B,, lies at distance e or 
less from exactly one code word; namely, the code word to which it 
belongs. In particular, the n-word 00---0 must lie at distance e or less 
from a single code word. That is to say, there is exactly one code word 
in the sphere of radius e centered at 00---0. If this code word is of 
weight y, then the generating function for the given e-code can be none 
other than the solution G(x) of (11) defined in the preceding para- 
graph. 

If there exists an e-code on n places in which the code word of least 
weight is of weight y, then there can be derived from it an e-code on n 
places in which the code word of least weight is of weight y’, where y’ 
is any integer satisfying 0 < y’ S e. The transformation is that of choos- 
ing certain places among n and then changing the letters of each n-word 
of B, in these places, 0’s to 1’s and 1’s to 0’s. (Such a transformation 


9 
: 
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corresponds to one of the operations of the orthogonal group which 
leaves invariant the n-cube representing B, .) Metric properties in B, 
are invariant under such a transformation, clearly, and an e-code is 
transformed into an e-code. Thus if there exists any e-code on n places 
then (11) must have e + 1 distinct polynomial solutions G‘” (x), satis- 
fying boundary conditions (31) for each case y = 0, 1, «++, e. 

In (25) for the coefficients v, , move contour I out to a circle sufficiently 
large that the expansion 


1 e! (const.) 


converges on I’, Suppose that the polynomial 6(£), of formal degree e, is 
of actual degree f: 6(é) = c&’ + 0(¢'), c ¥ 0, where 0 S f < e. Then 
the numerator of the integrand in (25) is of the form: (2*ct**’/s!) + 
and it is clear that 


Vs OSssSe-f-1 


Hence, if 6°”’(é) denotes the polynomial which gives G‘” (x) in the repre- 
sentation (24), then 6°” (£) must be of actual degree e — y. 

A particularly simple case is the one y = e; the polynomial 0‘ () 
must be of degree zero, and is determined by the normalization as 
6 (£) = 1. Thus 


(1 + x) dé 


2ri Jr (E — — 1, €) 


Irom this we have immediately the following 

Theorem: If there exists an e-code on n places then the equation 
g(n — 1, €) = Oin é has e distinct integer roots. 

Proof: If there exists an e-code on n places, then there exists an e-code 
on n places in which the code word of least weight is of weight e. The 
solution (32) of (11) must be the generating function for this e-code; 
hence (32) must reduce to a polynomial of formal degree n. If ¢g.(n — 1, &) 
had multiple roots then noncancelling logarithmic terms (22) would 
appear in the G°?(x) of (32). Thus ¢.(n — 1, &) must have e distinct 
roots §s, 1 < 8 < e. Each solution (1 + x)*(1 — x)""® of the homo- 
geneous equation appears in G(x) with nonvanishing coefficient: 


(Es — n)ee’(n — 1, &) 


| 
| Z 
(32 
G(x 
he 
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Since G(x) must be a polynomial in x, it must be expressible as a 
polynomial in 1 + x; hence each root £; must be an integer.* (It is not 
necessary to require further that 0 < &  < n, since it follows easily from 
(14) that any real root of ¢.(n — 1, £) satisfies0 < & < n — 1 provided 
n and e are integers such that 0 S e < n.) 

As a corollary we have that if e is odd then n must be odd. This follows 
from the theorem and the fact that }(n — 1) isa root of ¢.(n — 1, &) 
when e is odd, from (17). 

We consider next the case y = 0. If 00---0 is a code word, then its 
e-neighbors are the n-words of weight e. Furthermore, the n-words of 
weight less than e belong to the code word 00---0, and can be e-neigh- 
bors neither of 00---0 nor of any other code word. Hence it must be 
true that 


= (”) + 


With G,° (x) represented in the form (27), divide the factor ¢.(n, £)0 (&) 
in the numerator by the denominator; the result will be 


gn, = [(E — nen — 1, + r(&) 34) 


with quotient g(é) a polynomial of degree e — 1 and remainder r(é) a 
polynomial of formal degree e. The term involving g(£) obviously con- 
tributes nothing to G,°(x) in (27), so that from (33) and arguments 
similar to those giving G(x), above, r(£) must be the constant 


r(é) = (") = ¢.(n, n) 


From (34) we then obtain the values of 6°(é) at the poles of the inte- 
grand in (24), and thus 


G (zx) (1 + ay ¢e(n, n)(1 + a) 


ge(n 1,n) ge(n, (Es n)ge' (n 1, &3) 
Before obtaining G(x) explicitly for intermediate values of 7; we 
must first discuss a certain set of recursion relations holding between the 
coefficients v, of any solution of (11). These relations are 


e+p 


+s an 
p = 0, p , (36) 
s=e—pt+l 
* The condition of Golay for the existence of group codes, obtained by different 
means, is essentially that ¢.(m — 1, €) have at least one root an integer. Cf.: (4 
of Reference 4, in view of (16), above. 


4 ‘ 
q 
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where we define », = 0 for s < 0 and where the coefficients k,,, are 


(The derivation of (36) is given in Appendix A.) Equations (36), written 
out, are of the form 


ky, hy, = 0 


ke, e—1Ve—-1 — N2, Ve + ke, e+1¥e41 — ke, = 0 


from which we see that (36) may be used to determine 


Ve+l Vei2 


recursively in terms of 


We see also that if 
Ve = Vey = = = 
(with y such that 0 S y S e — 1) then 
Vert = Vexs = Mey = 0. 


This has the following interpretation in terms of e-codes. It is well known 
(and obvious) that two different code words in an e-code must be sepa- 
rated by distance at least 2e + 1. Hence if the code word of least weight 
in an e-code is of weight y then all other code words are of weight not 
less than 2e + 1 — y. In the generating function for such a code it must 
be the case that not only 


G(x) = 27 + 
but in fact 
G(x) = + (38) 
Equations (36) insure that this condition is satisfied automatically.* 
As a particular case of (38), we have 


G (2) = 1 + O(2""). 


We see that if we apply the operator L, to G(x) there will result 


L,G (x) = + (39) 


* It is also necessary for the existence of an e-code that (86) determine v.41 , 
NON-negative integers when Ve, Ve-1, *** , vo are those of (31). This 
condition is discussed a little further in Appendix A, 


hy 
| 
re 
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using the differential operator form for L,, (9). On the other hand, the 
function 


L,G (x) (1 + a)*(1 d (40) 

is a solution of differential equation (11), in view of the discussion fol- 
lowing (24). From (39) we see that this’ function can be none other 
than G'” (x). Finally, applying L, to G(x) in the form (35), we have 
explicitly 


gy(n, &)(1 + x)**(1 — x)" 
¢e(n, &) (Es — n)ge (n — 1, &) 


V. EXAMPLES 


The known cases where the condition of Hamming is satisfied are the 
following: 

Case I: e = 2 1 

The Hamming expression (1) reduces to unity. In fact, 


¢o(n a, 1, 


and the condition that all roots be integers is vacuous. The generating 
function for code words is (uniquely): 
n 
(0 1 xv 
(x) ( + ) 


= (1 + 2)” 


¢go(n — 


Kach n-word of B,, is a detection region and thus a code word. There is 
no error correction. 

Case II: e 2 1,n =e 

The Hamming expression becomes the sum of all the terms in the bi- 
nomial expansion of (1 + 1)". The “codes” in this class consist of a single 
code word surrounded by its detection region consisting of the sphere 
B,, of radius n. No signalling is possible, of course, but our methods still 
apply. 

From the representation 


g.(n, &) ets (i — 2) dx 


gst 


Cc 


(1_ + 2v)§ 
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(valid for all n, &) we have immediately 


9g” 


gn(n — 1, &) = 2" = 
n! 
and the roots are 0, 1, ++. , — 1. The generating function G“° (x) of 

(32) becomes* 


+2) =2" 


as one might expect. The explicit form for ¢,(n, &) is somewhat com- 
plicated, but for € an integer it follows immediately from definition (13) 
that 


en(n, = (—1)"* 


From (35), then, 


G(x) 


which, again, is not surprising. The details for other values of y seem to 
, 

be more tedious, although one expects (41) to yield G7” (2) = 2’. 

Case IIT: e = 1,n = 2e+ 1 


The Hamming expression in this case: 


consists of the first half of the terms in the binomial expansion of 
(1 + 1)**'. The code words in a code of this class are any two n-words 
separated by distance n (i.e., two vertices at opposite corners of the 
n-cube). The group codes in this class are the “majority rule’ codes. t 
From (42) we have (using the substitution y = 4v + 4°) 


*().=el (‘) denotes the descending factorial 
s 


+t The two code words in such a code are 00 --- 0 and 11 +--+ 1. An n-word be- 
longs to 00 +++ Oif it contains more 0’s than 1’s, and to 11 -+- Lif it contains more 
1’s than 0’s, 
q 


= @, 1, 
= 
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2” (1 + dy 
ex(n, = 


and, without difficulty, 


9° 
g(2e,é) = 2°(* = — — 3) (€ — 2e +1) 


The roots are 1, 3, --- , 2e — 1, and from (32): 


2e+1—€ 
G (x) + 2) (1 x) dé 


2° Jr (E — 2e — IE — IE — 3) --- 2 +1) 
2*(1 + x)(1 + 2)? (I 


In the case y = 0 we need the result 


ge(2e + 1, = 2 | 


from (43). It is then tedious but straightforward to obtain from (35) 


G(x) = > ') — 
= 


2r +1 
1 + are 
One expects to get 
= 2” + 
from (41), but verifies a appears to be complicated. 

Case IV: e = 1,n = 2'— 1 (¢ = 3,4, ---) 

The single error pt codes of Hamming’ are included here. 
(The examples for ¢ = 1, resp. ¢ = 2, appear under Case II, resp. Case 
III, above.) Since n is always odd the condition that ¢i(m — 1, é 
2& — n + 1 have an integer root is automatically satisfied. For y = 1 
the generating function is 


_ (i — x)*( x)” 


l+n 


from which we have 


(—1) s even 


: 
= 

| 
Vs 
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( 1 
1 {(n — 1) 
— (? s odd 
1+n\\s — 1) 
For y = 0, Eq. (35) works out as 


/ 


so that 


4(s+1) s odd 
\\s a(s — 1)/) 


Case V: e = 2,n = 90 
The double error correcting codes for n = 2, 5 are covered by Cases 
II, ILI, respectively. The discovery that 


1 + 90 + 4(90)(89) = 2” 
is due to Golay.’ We have 
2eo(n — 1, = (2E— n+ — (n — 1) 
with roots 
— 14 (n — 1)} 


Since these roots are not integers when n = 90, there can be no 2-code 
for n = 90.* H.S. Shapiro has shown (in unpublished work) that the 
Hamming condition for e = 2 is satisfied only in the cases n = 2, 5, 90, 
so that the only nontrivial 2-codes are those equivalent to the majority 
rule code on 5 places. 

Case VI: e = 3,7 

Golay’ finds: 


1 + 23 + 3(23)(22) + (23)(22)(21)/6 


and gives explicitly a 3-code on 23 places of group type. We have 
— 1,&) = (2 — n + 1)[(2E — n + 1)° — (3n — 5)| 
and when » 23 we verify that the roots are the integers 7, 11, 15. 
‘ 1 a 
Computations by the author show that for x < 10" the Hamming con- 
dition for e = 3 holds only when n = 3, 7, 25. 


* This settles a question raised by Golay, who shows that there is no code of 
group type in this case, but not that there is no code at all. 


n ( 1)** 2 seven 
| 
| 
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For e = 4 we have 
— 1, = — n + 1)° — (3n — 7)f — — 30n + 40) 
For n = 4, 9 this reduces to the forms given under Cases II, III. Pre- 
liminary calculations by the author shows that any other solutions of 
the Hamming condition fore = 4 must be such that » > 10", so that 
the question of the existence of 4-codes (other than the majority rule 
code) is somewhat academic. 

Computations of Mrs. G. Rowe of the Mathematical Research De- 
partment show that Cases I-VI cover all cases of the Hamming condition 
being satisfied in the range 


eS N, lsn 190 


APPENDIX A 


From (13) we have 


( ¢s(n, (Al) 


(1 +2) 


Applying the operator D = —(1 + x)*d/dx to both sides of (Al) p 
times, there results 


8s=0 


The substitution » = (1 + x) * reduces D to d/dv, so that 


dv? 


o 


a=0 


= (—1) 4,3 (1 + x) 


using Leibnitz’s rule. We substitute this into Eq. (A2), multiply both 


+ x)" — v)*p" 


sides of the result by 
(1+ x)" — x)? 


(with 7 arbitrary), and then equate coefficients of 2 on both sides; 
there obtains 


t+p 
n—é 
2 ( — 7, &) = alm, 73 
p s=0 


| 
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valid for all n, &, 7 and all non-negative integers p, t, where 


The coefficients kp,.(n, 7; ¢) vanish unless s S ¢ + p; if n and p — + 
are non-negative integers then the coefficients x),.(n, 7; 2) are positive 
integers provided t — p + 7 S s and vanish otherwise. In particular, 
(setting r = 1, ¢t = e), 


p s=e—p+l1 

where we define ¢,(n, £) = 0 for s < 0; the ky. = kp.s(m, 1; e) are 

those of (37) of the text. If we multiply », of (25) by (—1)***k,.. 

and sum on s there results (36), since the left hand member of (A5) is 

a polynomial multiple of the denominator of the integrand in (25). 

If the code word of least weight in an e-code is of weight y, then the 
first nontrivial one of the (37) is the one for p = e + 1 — y, and it 
gives (since v,‘” = 1) 

+1—y 


A necessary condition for the existence of an e-code on n places is that 
this expression be a non-negative integer in each case y = 0, 1, ---, @. 
It is not clear, however, that this condition is independent of the one 
set forth in the theorem of Section IV. 


APPENDIX B 


We give here a relation due to K. M. Case® which shows that the 
statement of our main result as it appears in the Abstract heading this 
article agrees with the theorem proved in Section IV. 

In the defining relation 


(1+ 2)'(1 — = 7) (B1) 


s=0 


for the coefficients ¢,(n, r) assume that n and r are integers, multiply 


n 
both sides by (—1)’ (") y, and sum on r, 0 < r S n. The result is 


(1 — x) — yi +2)!" = > (”) ya'g.(n,r)  (B2) 


r=0 s=0 


= 
534 
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fi 
| 
hy 


BINARY BLOCK CODING 


Rearrange the left hand member and re-expand it, to get 


— x) — yi + 2))" = y) — + y))" 


— 


—1)* y'¢(n, 8) 


s=0 r=0 
Comparing coefficients of x’y" in (B2) and (B3), we have, finally, 


n 


(—1)’ g:(n, r) = (—1)° 


¢-(n,s) (n,7r,s integers), (B4) 


or, changing notation slightly, 


(B5) 


(with n, e, integers and 0 S n — 1). Thus if ¢.(n — 1, vanishes 
for e different integers & then so must ¢ge(m — 1, e), at least when e < n. 
But ¢:(n — 1, e) is the coefficient of x* in (1 + x)‘(1 — x)" '* when this 
is written out as a polynomial in 2, by definition. 
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Selecting the Best One of Several 
Binomial Populations 


By MILTON SOBEL and MARILYN J. HUYETT 


(Manuscript received July 17, 1956) 


Tables have been prepared for use in any experiment designed to selec 
that particular one of k binomial processes or populations with the highest 
(long time) yteld or the highest probability of success. Before experimenta- 
tion the experimenter chooses two constants d* and P* (0 < d* S 1;0 8 
P* < 1) and specifies that he would like to guarantee a probability of at 
least P* of a correct selection whenever the true difference between the long- 
time yields associated with the best and the second best processes is at least 
d*. The tables show the smallest number of units required per process to be 
put on test to satisfy this specification. Separate tables are given for k = 2, 
3, 4. and 10. Each table gives the result for d* = 0.05 (0.05) 0.50 and for 
P* = 0.50, 0.60, 0.70, 0.75, 0.80, 0.85, 0.90, 0.95, and 0.99. For values of 
d* and P* not considered in the tables, graphs are given on which inter- 
polation can be carried out. Graphs have also been constructed to make 
possible an interpolation or extrapolation for other values of k. An alterna- 
live specification is given for use when the experimenter has some a priori 
knowledge of the processes and their probabilities of success. This specifica- 
tion is then compared with the original specification. Applications of these 
tables to different types of problems are considered. 


INTRODUCTION AND SUMMARY 


A frequently encountered problem is that of selecting the “best” 
one of k (k = 2) processes or populations on the basis of the same num- 
ber n of observations from each process. We shall assume that the given 
processes are all binomial or ‘‘go — no go”’ processes and that the best 
process is the one with the highest probability of obtaining a “success” 
on a single observation. We shall consider a single sample or nonse- 
quential procedure which means that the common number n of observa- 
tions from each process is to be determined before experimentation 
starts. The corresponding sequential problem is being investigated.’ 
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Briefly, the technique employed here is to let the experimenter decide 
how ‘‘close”’ the best and second best processes can be before he is willing 
to relax his control on the probability of a correct selection. The selection 
of a best process will, of course, be made on the basis of the largest 


observed frequency of “success”; the only remaining problem is to 
determine the value of n. Tables and graphs which cover almost all 
practical problems in this framework are given for determining the re- 
quired value of n. In particular, tables and graphs are given for k = 2, 
3, 4 and 10. Graphs are also given to approximate the result for any value 
of k up to 100. 

This problem arises in many widely different fields of endeavor; we 
shall briefly consider two industrial applications. One application of the 
binomial problem is to comparative yield studies. Here success corre- 
sponds to the making of a good unit, and the goal is to select the process 
with the highest (long-time) yield. Another application of the binomial 
problem is to comparative life testing studies. In this case the experi- 
menter selects a fixed time 7’ and defines the best process as the one for 
which the probability of any one unit surviving this time 7’ is highest. 
Then, of course, a successful unit is one which survives the time 7’. In 
treating this as a binomial we are discarding the information contained 
in the exact times of failure. In many cases the times of failure are either 
unknown or very inexact; in other cases it is not known how to utilize 
the knowledge of the exact times of failure. Hence, it would be valuable 
to know the results for the more basic binomial problem. The time 7 is 
considered fixed throughout; its value is determined by non-statistical 
considerations. The specification and the final decision of the experi- 
menter all refer to this predetermined time 7’. It should be noted that 
the experimenter cannot use information obtained from the continuation 
of the test beyond time 7 since the best process for 7 is not necessarily 
the best process for a longer time, say 107’. This binomial type of analysis 
has the advantage that it does not assume any particular form of the 
life distribution. In particular, the assumption of exponential life is 
avoided. 

The presence of a priori information changes the number of observa- 
tions required. An alternative specification is given which is justified by 
certain 4 priori information based on past experimentation. The amount 
of saving is briefly examined. This area of utilizing 4 priori information 
to reduce the number of observations required should be investigated 
further. 

The treatment of the problem in this paper is based on the assump- 
tion that, after experimentation is carried out, the experimenter must 
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choose one of the / processes and assert that it is best. If he allows him- 
self the possibility of hedging and asserting that he needs further ex- 
perimentation, then the problem changes and the tables of this paper are 
not appropriate. 

The following additional assumptions will be made: 

1. Observations from the same or different processes are independent. 

2. Observations from the same process have a common fixed proba- 
bility of ‘‘success”’. 

3. There is no chance of error in determining whether a success or a 
failure has occurred. 


The assumption of a common probability fixed once and for all for 
each process is one that should be checked carefully in any practical 


application of the results in this paper. Roughly speaking, this assump- 
tion states that each of the processes is in a state of statistical control 
as far as the probability of success is concerned. 

We shall consider only the case in which the same number n of obser- 
vations are taken from each process. This is certainly reasonable for a 
single sample procedure if no 4 priori information is assumed. 


STATISTICAL FORMULATION OF THE PROBLEM 


Each of k given binomial populations II; is associated with a fixed 
probability of success p; where 0 S p; 1 (7 = 1, 2, --- , For ex- 
ample, in the yield problem p; is the long-time yield for process II; or 
the probability that any one unit from IT; is a good one. Let the ordered 
values of the p; be denoted by 

Puy 2 Pa Pw (1) 
No & priori information is assumed about the values of the p,; or about 
the correspondence between the ordered p;;; and the k identifiable popu- 
lations II;. In particular, we have no idea before experimentation 
starts whether py) is associated with Tl, , Ty, --- , or Tk. 

The problem is to select the population associated with py; on the 
basis of n observations from each population. If there are ¢ ties for first 
place, say 

Paul = Pe = = Pia > (2) 


then we shall certainly be content with the selection of any one of the 
associated ¢ populations as the best one. 

As an index of the true difference (or distance) between the best and 
second best populations we introduce the symbol 


d = py — 


= 
‘ 
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It is assumed that if the difference d between the best and second best 
populations is small enough, then the error involved in wrongly select- 
ing the second best process as the best one is an error of little or no con- 
sequence. The experimenter is therefore asked to specify two quantities 
which will determine the number n of observations he is required to 
take from each process. 


Specification: He specifies the smallest value d* 
(0 < d* S 1) of d for which it would be economically 
desirable to make the correct selection. He also specifies 
a probability P* (0 S P* < 1) of making a correct 
selection that he would like to guarantee whenever the 
true difference d = d*. 


Letting Pes = Pes (pay, *** , Puy) denote the probability of a correct 
selection we can now rewrite the specification that the experimenter 
wants to satisfy in the simple form 


Poa tor (5) 


[The word “specification” will be used below to denote the specified 
pair of constants (d*, P*) as well as the condition (5); it will be clear 
from the text which is meant.] Since the final selection is to be made on 
the basis of the observed frequency of success, the essential problem is 
to find the number n of observations required per process to satisfy the 
specification (5). 

The possibility that d may be less than d* is not being overlooked. 
The region d < d* is being regarded as a zone of indifference in the 
sense that if d < d*, then we do not care which process is selected as best 
so long as its p-value is within d* of the highest p-value py) . For values 
of P* < 1/k no tables are needed since a probability of 1/k can be at- 
tained by chance alone. 

Some comments on the above approach and on a possible modifica- 
tion have been placed in Appendix I in order to preserve the conti- 
nuity of the paper. 


CONFIDENCE STATEMENT 


After the experiment is completed and the selection of a best process 
is made, the experimenter can make a confidence statement with confi- 
dence level P*. Let p, denote the true p-value of the selected population 
and let p, denote the maximum true p-value over all unselected popula- 


pire: 
fe 
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tions. Then the confidence statement, consisting of two sets of inequali- 
ties 


Puy d* Ps Puy = = a* 


| 
| 
| 


S Pu S + lo S Pu — Pr S 


has confidence level P*. It should be noted that the above confidence 
statement is not a statement about the value of any p but is a statement 
about the correctness of the selection made. 


LEAST FAVORABLE CONFIGURATION 


The main idea used in the construction of the tables was that of a 
least favorable configuration. Before defining this concept we shall 
define the set of configurations 


Pay — d = = = = Py (6) 


obtained by letting d in (6) vary over the closed interval (d*, 1) as the 
Less-Favorable set of configurations. It is intuitively clear and will be 
rigorously shown in Appendix II that if our procedure satisfies the 
specification for any true configuration (6) with d = d’ and pu, = pry, 
then it will also satisfy the specification when 


0 0 — 
Py — d = 2 DH (7) 


Of course, we shall be interested particularly in the case in which d 
equals the specified value d*. If d = d* is fixed in (6), then (6) specifies 
the differences between the p-values, but the “location” of the set is still 
not specified. We shall use py; to locate the set of p-values. The proba- 
bility Pes of a correct selection for configurations like (6) with d = d* 
depends not only on d*, n and k but also on the location py, of the 
largest p-value (except for the special case / = 2 and n = 1). [In the 
corresponding problem for selecting the largest population mean of /: 
independent Normal distributions with unit variance,’ this probability 
Pes depends only on the differences and, hence, only on d in the configura- 
tion corresponding to 

When (6) holds with any fixed value of d, the probability Pcs (for any 
fixed n) may be regarded as a function of pa; where d S py; S 1). This 
function is continuous and bounded over a closed interval and therefore 
assumes its minimum value at some point pa) (d) = pr) (dyn) in the 
closed interval (d,1). Fig. 1(b) gives the value of py); (d) as a function 
of d for k = 3 and for n = 1, 2, 4, 10 and «. For any particular value 


| 
Wear 
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of n and for d = d* we shall be particularly interested in the value pj; = 
Pay (d*;n) since this (as shown in Appendix II) gives the smallest 
probability P(s of a correct selection for all the configurations included 
in the statement of the experimenter’s specification. This particular con- 


figuration (6) with d d* and pay piy (which depends on n) is 


called the Least-Favorable Configuration. 

Although the least favorable configuration depends on n, it has been 
empirically found that for n = 10 (and in some cases for n = 4) the least 
favorable configuration is approximately given by pay = 4 (1 + d*) 
in which the two values, p(ij and pis; = pi; — d* are symmetric about }. 
This symmetric configuration clearly does not depend on n. Fig. 1(b) 
shows that as n — ~ the least favorable configuration approaches this 
symmetric configuration (i.e., the straight line marked n = x) quite 
rapidly for any value of d. In Appendix III it is proved that the sym- 
metric configuration is least favorable as n — x. Fig. 1(a) shows for 
k = 3,n = 10, and any value of d the error in Pcs which arises as a result 
of using the symmetric configuration instead of the true least favorable 
configuration. 


EXACT FORMULA FOR N=2 


13d +12-\/25d2 +24d 


p,,,(d) 


0.5 
j ( OR d*) 


Fig. 1 (a) Error in Peg as a result of using the symmetric configuration in- 
stead of the least favorable configuration for k = 3, n = 10, and any common true 
difference d. (b) Least favorable value p.,; (d) of pi.) as a function of the common 
true difference d = py; — ptz},t 2 2, for k = 3 and selected values of n. (for 


d = d*, (d) = 


| 
| 
| | | } 
| 
a 
) | 
n=1 
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x | | 
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CONSTRUCTION OF THE TABLES 


Consider any fixed value of d*. For each of a set of increasing values 
of n the minimum probability Pés of a correct selection for d = d* 
(i.e., the probability for the least favorable configuration) was computed, 
These calculations were then inverted to find the smallest » for which 
the Pés is greater than or equal to the specified value P*. Tables I 
through IV give the smallest value of n for k = 2, 3, 4, and 10, for d* = 
0.05 (0.05) 0.50, and for selected values of P*. Graphs corresponding to 
these tables are given in Figs. 2 through 5. 

For small values of n (say, n < 10) it was necessary to approximate 
pin) by calculating the Pes exactly for several values of pu; and proceed- 
ing in the direction of the minimum probability Pés. For the special 
case n = 2 and k = 3 an explicit formula for pj(jy is given on Fig. 1. 

For large values of n (say, n > 10) the Pés was calculated by assum- 
ing the symmetric configuration. Here it was necessary to make use of 
the normal approximation to the binomial. Fortunately the appropriate 
table needed in this normal approximation is already published.” The 
proof that this table is appropriate is given in Appendix ILI. The result- 
ing value of n is given by 
B 


= 


where the constant B, depending on P* and k, is equal to 10° and C is the 
entry in the appropriate column of Table I of R. E. Bechhofer’s paper.” 
A short table of B values, Table V (see page 550), is included in this 
paper to make it self-contained. 

The middle expression in (8) will be referred to as the normal ap- 
proximation and the right hand expression in (8) will be referred to as 
the “straight line’ approximation. In many cases it has been empiri- 
cally found that these two expressions give close lower and upper bounds 
to the true value. Thus by noting the curves drawn in Figs. 4 and 6 for 
k = 4, P* => 0.75 it appears that for all values of d* the true Pés is 
between the normal approximation and the straight line approximation. 
Assuming this to be so, it follows that for k = 4, P* = 0.75 the required 
value of n satisfies the inequalities 
E -i-—-d@isns (9) 

2 
where [x] denotes the smallest integer greater than or equal to the en- 
closed quantity x. This result (9) is empirical and not based on any 
mathematically proven inequalities. It is used here only to estimate the 
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Tasie | Number or Units Required Process TO GUARANTER 
\ PROBABILITY OF P?* OF SELECTING THE Best oF k BINOMIAL Proc- 
ESSES WHEN THE TRUE DIFFERENCE — Pry IS AT Least d*, 
(k = 2) 

The three values in each group are: (1) Normal approximation, ( 

Straight line approximation, and (3) Smallest integer required. 


P* 


0.50} 0.60 0.80 0.85 0.90 | 0.95 | 0.99 


0.05 | 0 | 12.81 | 90.77 141.30 | 214.29 | 327.66 539.77 | 1079.70 
12.84 | 90.99 | 141.66 | 214.83 | 328.48 | 541.12 | 1082.41 
14 


| 0 142 | 215 541 1082 
0.10 | 0 18 | 22.5% | 35.06 $1.30 133.93 267 .90 
5: 32.12 135.28 60 
135 


15.03 

15 
9.69 
04 


10 
3.46 


10.65+ 
).68 13.36 
7 13 


0.50/0) 0.10 | 0. 06 8.12 
0.1: 4 AL 10.82 
5 10 


| 
35.6! 0.12 | 9 
39 23.33 36.50 60 
= 15.39 | 23.87 37 4.94 
39 | 9.88 | 15.7 | 24 32.47 67.65 
“0.15 |0| | 1011 16 | 19.71 
0.15 | 0} 1.43 11 | 12.89 | 20.53 34 
0) 2 | 8.50 13.43 | 40.59 
0 | | 5.69 | 9 5 6 | «(12.32 21.64 42 
0.20 0| 0.8 6 8.06 13.14 
4 5.31 | 8.59 14 — 27.36 
48 | 3.4 | 5.67 | 9 — 13.6: 30.0 
(0.25101 0 ‘51 | 3.64 | 8.30 29 
0; 1 —_— 3.58 | 5.97 | 9 19.38 
10 1 2.54 | 4.38 | 14.21 
| 23 | 1.63 2.89 5 16.91 
0.35 0 0.2 2 —— 2.82 5.13 9 & 
0 1.8 3.5 5 
0.40 | 0 | 0.20 | 1 on 06 = 
0 | 1.39 2.65 4 
3 | 0.90 1.75— 3 
0.45 | 0 | 9 
0.45 | 0 | 2 
a 
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Taste Il —- Number or Units REQUIRED PER Process TO GUARAN- 
TEE A ProBABILITY OF P* OF SELECTING THE Brest oF k BINOMIAL 
PROCESSES WHEN THE TRUE DIFFERENCE ppuy — pre 
= 3) 

The three values in each group are: (1) Normal approximation, (2) 

Straight line approximation, and (3) Smallest integer required. 


Is At LEAST 


| oso | 0.60 0.75 0.80 0.85 | 0.90 0.95 0.99 


0.05 | 30.89 | 78.16, 205.06 | 27 
30.97 | 78.36) 205.58 | 27 
3 79 206 27 


2.36 | 363.06 | 496.14 | 732.63 1305.21 
3.04 | 363.97 | 497.38 734.46 1308.49 
3 | 364 498 735 1308 


7.66 | 19.39) 50.88 | 67.58 | 90.08 | 123.10 | 181.78 | 323.85+4 
7.74 | 19.59) 51.39 | 68.26 | 90.99 | 124.34 | 183.62 | 327.12 
8 20 52 | 69 | 91 125 184 327 


0.15 | 3.36 | 8.51) 22.33 | 29.66 | 39.53 | 54.02 | 79.77 | 142.12 
3.44 | 8.71) 22.84 | 30.34 | 40.44 | 55.26 | 81 145.39 


23 


30 78 


0.35 0.55+) 


1. 6.52 8.91 | 13.15+| 23.43 
| 0.63 | 1.60 4.20 | 5.57 7.43 | 10.15 | 14.99 | 26.70 
2 |2 | 8 6 8 10 15 26 
0.40 | 0.41 | 1.03) 2.70 3.58 1.78 | 6.53 9.64 | 17.17 
0.48 | 1.22 3.21 4.27 5.69 7.77 | 11.48 | 20.45— 
1 2 1 | $6 | 6 | 8 11 20 
0.45) 0.30 | 0.77, 2.02 2.69 | 3.58 | 4.6 
0.38 | 0.97) 2.5 3.37 | 6. 


20 
0 won 2 3 —| 16.; 
4 3) = 17 9 : 
571 3.65 =. 44 
| 5.7 14 — 07 : 
50 | 0. 33.08 


Tarie 


| 


0.50 


0.05 69.85— 


70.02 
71 
0.10 17.3 
17.51 
18 
0.15 | 7.61 
7.78 
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5 
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| 3 
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-NumBer oF Units REQUIRED PER Process TO GUAR- 


0.60 


Straight line approximatic 
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\NTEE A PROBABILITY OF P* OF SELECTING THE Best or k BINOMIAL 
PROCESSES WHEN THE TRUE DIFFERENCE Pi — Pry IS AT LEAST 
d*, (k = 4) 
The three values in each group are : (1) Normal approximation, (2) 
yn, and (3) Smallest inte 
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TaBLe IV 


d*. (k = 10) 


The three values in each group are: (1) Normal approximation, ( 


OF 


SEVERAL BINOMIAL POPULATIO 


P(2) 


Straight line approximation, and (3) Smallest integer required. 


911.154 
2.43 
513 


0.05 


9.83 


0.90 0.95 


1165.49 
1168.41 
1169 


604. 
605. 


887 .54 
889.77 
890 


| 220.22 | 289.18 
222.44 | 292.10 
22: 291 


| 


96.64 


70.10 
73.03 
72 

33 .37 


35.59 


| 43.82 
| 46.74 
| 46 
24.72 | 
24 
15.93 | 20.92 
18.16 | 23.85— 
18 23 


32.46 
32 


11.68 | 15.34 
13.90 | 18.26 


8.90 | 
9 


11.68 
11 


Ns 547 


NUMBER OF UNITs REQUIRED PER PROCESS TO GUARAN- 
TEE A PROBABILITY OF P* OF SELECTING THE 
PROCESSES WHEN THE TRUE DIFFERENCE pry 


Best or k BINOMIAL 
iS AT LEAST 
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Fig. 2. — Number of units n required per process to guarantee a probability 
of P* of selecting the better of two binomial processes when the true difference 
d is at least d*. 
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Fig. 3. — Number of units n required per process to guarantee a probability of 
P* of selecting the best of three binomial processes when the true difference d is 
at least d*. 
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Fig. 4 — Number of units x required per process to guarantee a probability of 
Pfs selecting the best of four bionimal processes when the true difference d is 
at least d*. 


1,00 


200 300 400 


Fig. 5 — Number of units n required per process to guarantee a probability of 
P* of selecting the best of ten binomial processes when the true difference d is 
at least d*. 
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20 30 40 5060 80 100 
n 


Fig. 6 — Bounds for the number of units n required per process to guarantee a 
probability of P* of selecting the best of four binomial processes when the true 
difference d is at least d*. 
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order of magnitude of the error in our large sample calculations. For ex- 
ample, if k = 4, d* = 0.05 and P* = 0.90, then from Table V we find 
that B = 1.5026 and the two expressions in (8) yield 599.54 and 601.04. 
Hence, it would follow from (9) that » is 600 or 601 or 602. Based on an 
investigation of the behavior of these two approximations in the case of 
smaller P* or larger d* values, it is estimated that the true value of n is 
601. Even if the correct value is 600 or 602 the error would be less than } 
of 1 per cent. Fig. 6 illustrates these bounds on the Pes for k = 4, 
P* = 0.50, 0.75 and 0.99. For P* < 0.60 the straight line approxima- 
tion is a closer lower bound than the normal approximation. 

It is estimated that all integer entries in Tables I through IV have an 
error of at most 1 per cent and, in particular, that all entries under 100 
are exact. 


OTHER VALUES OF / 


In addition to the tables and graphs for k = 2, 3, 4 and 10 there are 
also graphs (Figs. 7 through 14) on which interpolation can be carried 
out for /& = 5 through 9 and on which extrapolation can be carried out 
for /& = 11 through 100. By plotting n versus log k (or n versus / on 
semi-log paper) and drawing a straight (dashed) line through the values 
of n fork = 4 and k = 10 we obtain results which are remarkably good 
approximations for / > 10. The solid curve in these figures connects the 
true values obtained for / = 2, 3, 4 and 10. 

For large values of k the theoretical justification for a straight line 
approximation is given in Appendix V. In order to check the accuracy of 
our procedure of drawing the straight line through the values of n 
computed for / = 4 and / = 10, we have chosen two points at k = 101 
for an independent computation of the probability of a correct selection. 
For P* = 0.90, d* = 0.10 and & = 101 the dashed line in Fig. 12 gives n 
as approximately 400. To check this we computed the normal approxi- 
mation to the probability Pés of a correct selection for the least favor- 
able configuration in the form 

Pe. | (2 + h)f(x) dx = —= F™(av/2 + dx (10) 
Vn 


where 


, = 4.02015 in this example) (11) 

f(x) is the normal density and F(x) is its e.d.f. This was computed by a 
method suggested by Salzer, Zucker and Capuano’ and the result was 


oo 
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Fig. 7 — Number of units n required per process to guarantee a probability of 
P* = 0.50 of selecting the best of k binomial processes when the true difference 
Pi] — Pte) 18 at least d*. 
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Fig. 8 — Number of units n required per process to guarantee a probability of 
P* = 0.60 of selecting the best of k binomial processes when the true difference 
— Pte} 18 at least d*. 
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700 
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APPROXIMATION 


Fig. 9 — Number of units n required per process to guarantee a probability of 
P* = 0.75 of selecting the best of k binomial processes when the true difference 
— is at least d*. 
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Fig. 10 — Number of units n required per process to guarantee a probability of 
P* = 0.80 of selecting the best of k binomial processes when the true difference 
— is at least d*. 
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Fic. 11. —— Number of units n required per process to guarantee a probability 
of P* = 0.85 of selecting the best of k binomial processes when the true differ 
ence — is at least d*. 
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Fig. 12 — Number of units n required per process to guarantee a probability of 
P* = 0.90 of selecting the best of k binomial processes when the true difference 
— Pty! is at least d*. 
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Fig. 13 — Number of units n required per process to guarantee a probability of 
P* = 0.95 of selecting the best of k binomial processes when the true difference 


Ptl — Pt.) is at least d*. 
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Fig. 14 — Number of units n required per process to guarantee a probability of 
P* = 0.99 of selecting the best of k binomial processes when the true difference 
— is at least d*. 
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cs & 0.9168 as compared to the value 0.90 in Fig. 12. The expression 
(10) is derived in Appendix IIT. Another check was made at P* = 0.99, 
d* = 0.20 and k = 101. The value of n from Fig. 14 is 162. The value of 
the Pés computed from (10) using Salzer, Zucker and Capuano’ is 
0.9925+. 

Further calculation using (10) yielded the more accurate results 378 
and 154 instead of 400 and 162, respectively, in the above illustrations. 
The error in both cases is less than 6 per cent; for smaller values of k the 
percentage error will, of course, be much less. 

For interpolation the results are estimated to be within 1 per cent 
of the correct value. For example we estimate from Fig. 11 that the re- 
quired value of n for k = 5, P* = 0.85 and d* = 0.05 is 523. This value 
was computed by the normal approximation and found to be 522. 


TIED POPULATIONS 


In computing the tables and graphs it was assumed that if two or more 
populations are tied for first place then one of these is selected by a chance 
device which assigns equal probability to each of them. The experi- 


menter may want to select one of these contenders for first place by 


economic or other considerations. In most practical problems we may 
assume that such a selection is at random as far as the probability of a 
correct selection is concerned. Hence, it appears reasonable to use the 
tables in this paper without any corrections even when the rule for tied 
populations is altered in the manner described above. 

It is interesting to note that in the yield problem the experimenter 
may settle the question of ties for first place by taking more observa- 
tions until the tie is broken. However, in the life-testing problem he may 
not settle ties by letting the test run beyond time 7’ since the best process 
for time 7 is not necessarily the best for a time greater than 7’. 

In some applications when there are two or more populations tied for 
first place, the experimenter may prefer to recommend all these con- 
tenders for first place rather than select one of them by a chance device. 
In this case we shall agree to call the selection a correct one if the recom- 
mended set contains the best population (or, when pu; = py), if the 
recommended set contains at least one of the best populations). Exact 
tables for the procedure so altered have not been computed. However, 
if the value of n is large and this rule for tied populations is used, then 
the experimenter may reduce the tabled values by an amount equal to 
the largest integer contained in 1/d*. For example, using the above rule 
for tied populations for the case k = 2, P* = 0.99, d* = 0.30, the tabled 
value 29 can be reduced by 3 giving the result 26. 
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ALTERNATIVE SPECIFICATION 


If the experimenter has some A priori knowledge about the processes, 
then he will prefer to specify the following three quantities in order to 


determine the number n of observations he is required to take from 
each process. 


Specification: He specifies phy, and phy (0 


< pi S 
pin 


1) in the neighborhood of his estimate of the 
probabilities associated with his processes. He also speci- 
fies a probability P* (0 < P* < 1) that he would like 
to guarantee of making a correct selection whenever the 
true pa) = pty and the true py; < pty. 


TABLE VI NuMBER OF UNITS REQUIRED PER PROCESS TO GUARAN- 
TEE A PROBABILITY OF P* OF SELECTING THE BETTER OF Two Br- 
NOMIAL PROCESSES WHEN THE TRUE poy 2 pty AND THE TRUE 
P21 S Pro) . (ALTERNATIVE SPECIFICATION, k = 2) 


Pt} = = 9.85 


= 0.80 


bd 
= 0.80 


1 

9 
53 
83 
124 
189 
312 
623 


Taste VII — Numser or Units Requirep PER Process TO GuAR- 


ANTEE A PROBABILITY OF P* OF SELECTING THE Best oF Four Br- 
rr * rin 
NOMIAL PROCESSES WHEN THE TRUE pp) 2 pry AND THE TRUE 
Piz) S Pr) . (ALTERNATIVE SPECIFICATION, k = 4) 
= 0.75 = 0.95 Pry =9.90 | = 0.85 
Pio) = 0.60 Pie] = 0.80 | = 0.80 Pie 


} = 0.80 | o} = 0.90 
2] 


7 42 21 
14 8 79 39 
28 168 
35 8 46 211 101 
45 128 
5Y 171 
83 239 


(12) 
= 
p> | 
| Pi) = 0 = 0.95 
0.75 10 2 1 | 
0.380 14 6 | 3 
.99 53 16 10) 
0.99 | 106 25 42 60 Sas ee 
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Again we can rewrite the specification that the experimenter wants 
to satisfy in the simple form 


Pog = P* for pay = pty and py S pty (13) 


Tables VI and VII give the number of observations required per proc- 
ess for several selected triplets of specified constants (pf), pts, P*) 
when k = 2 and k = 4. These results are also given in graphical form 
in Figs. 15 and 16. 
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Fig. 15 — Number of units required per process to guarantee a probability of 
P* of selecting the better of two binomial processes when the true pj:; = pti) and 
* 
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For example, on the basis of past experience the experimenter may 
estimate that the probabilities associated with his k = 4 processes are 
all in the neighborhood of 0.60. This constitutes his 4 priori knowledge. 
He may then decide that he would like to make a correct selection 
with probability P* = 0.85 when the best process has a yield of at least 
75 per cent and all the others have a yield of at most 60 per cent. Enter- 
ing column | of Table VII we find that » = 45 observations per process 
are required, 

It is much more difficult to furnish tables for the alternative specifica- 
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Fig. 16 — Number of units required per process to guarantee a probability of 
P* of selecting the best of four binomial processes when the true pi; 2 pty and 
the true S 
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tion since there is an extra parameter to vary and the appropriate 
tables for the normal approximation are not available. 

In the computation of these probabilities the least favorable con- 
figuration 


Pa = pry and py = pa) = Pay = = Paw (14) 


was used. It follows from Appendix II that if the probability of a correct 
selection is at least P* when (14) holds, then it will also be at least P* 
when 


For small values of n, exact calculations were carried out. A typical 
exact calculation is shown in Appendix IV. The approximations used 
for large n are given in Appendix IIT. 


Fig. 17 — Illustration of the varying zones of indifference and the least favor- 
able configuration for k = 4.and P* = 0.85. (For n = 5 the longest vertical segment 
occurs at the point A, where the abcissa and the ordinate are symmetrical about 
0.5) 
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COMPARISON OF THE TWO SPECIFICATIONS 


It should be pointed out that for a given k the same value of n would 
satisfy the specification for different specified triplets 


* 
P*, 


For example with / = 4, P* = 0.85 and n fixed we could vary pf in 
the alternative specification and compute for each pf the correspond- 
ing largest value of pf; such that the specification (P*, phy , pty) is 
satisfied. This is shown in Fig. 17 for n = 5, 10, 20 and 60. The vertical 
distance in Fig. 17 between the appropriate curve and the 45° line 
(n = x) is the length of the indifference zone (pty , pj). The indif- 
ference zone widens in the center and narrows at both ends. In fact we 
find just as in the original specification that for n greater than (say) 4 
the indifference zone is widest when pf and pf; are symmetrical about 
0.5. It is clear that the two specifications would coincide if we took d* in 
the original specification and set pty; = 3 (1 + d*), phy = 3 (1 — a*) 


1.00 


Fig. 18 — Probability of a correct selection as a function of the true difference 
d = pti) — pts) under the least favorable configuration for k = 2 and selected 
values of n. 
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in the alternative specification. We shall be interested in comparing the 
alternative specification (P*, pf , pty) with the original specification 
with the same P* and with d* set equal to pty — pty. It is clear that 
the value of n required for the original specification will always be 
larger. 

The original specification is simpler and is preferable to the alternative 
specification when little or nothing is known about the processes on test, 
but the price that has to be paid for ignorance is an increase in the 


| 
12) 0.05 0.10 015 020 025 030 035 040 045 50 
d 


Fig. 19 — Probability of a correct selection as a function of the true difference 
d = pti) — pte) under the least favorable configuration for k = 3 and selected 
values of n. 
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required number of observations. In the example of the preceding sec- 
tion the value of n required for the alternative specification is 45 as 
compared to 51 observations per process required to satisfy the original 
specification with the same P* and with d* = pi; — pf). Here the 
saving is only moderate. The saving will be much larger if pty and 


Fig. 20 — Probability of a correct selection as a function of the true difference 
d = pty — pf) under the least favorable configuration for k = 4 and selected 
of n. 
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Piz; are further from 0.5 and d* is small. For example, for k = 4, P* = 
0.95, pi = 0.95 and pi; = 0.90 the value of n required for the alter- 
native specification is 239 as compared to 850 observations per process 
required to satisfy the original specification with the same P* and with 
d* = 0.05. The alternative specification is justified on the basis of a priori 
or previous information about the approximate values of the p’s. 


REVERSING THE TABLES 


The experimenter may wish to use the tables of this paper in reverse. 
For example, if 7 is fixed and d* is specified by the experimenter, then 
by using the appropriate table he can find the probability of a correct 
selection that is guaranteed for d = d*; i.e., a greatest lower bound to 
the probability of a correct selection for d 2 d*. This process of re- 
versing the given values and the values to be computed can most easily 
be carried out on graphs. For example, the above problem of finding the 
guaranteed probability of a correct selection given d* and n is most easily 
carried out on Figs. 18, 19, and 20. 


APPENDIX 
MODIFICATION OF THE ORIGINAL SPECIFICATION 


The same value of n will, of course, satisfy the specification for dif- 
ferent pairs of specified values (d*, P*). From a purely mathematical 
point of view it is not necessary that d* should be the smallest difference 
for which the experimenter desires to make a correct selection. For ex- 
ample, if k = 3 the experimenter could specify any one of the four pairs 
(0.10, 0.60), (0.25, 0.90), (0.30, 0.95) or (0.40, 0.99) and obtain the same 
result, namely n = 20. The experimenter may prefer to specify the curve 
or set of points corresponding to a fixed n. Several such curves are given 
in Figs. 18, 19, and 20 fork = 2, 3, and 4, respectively. The experi- 
menter would decide in advance on some property of the curve that he 
considers desirable and from the appropriate figure he could find the 
curve with the smallest n-value that satisfies the desired property. 

The main point of the above paragraph is to point out that the original 
specification in the body of the paper is one particular way, but not the 
only way, of stating a specification that will determine a value of n. 
The only criterion for a good way to state the specification is that the 
experimenter should be able to bring his best judgment (or best guesses) 
to bear on the quantities that have to be specified in advance. 


: 
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APPENDIX II 


MONOTONICITY PROPERTIES 


We shall prove that for any fixed d (0 S d < 1) the probability Pes 
of a correct selection is smaller for the configuration: 


Pay — = = = = Puy (Al) 


than for any configuration given by 


Puy — d = py 


IIV 


= pu (A2) 


where pi) is considered fixed and the p;,, (¢ = 2) are variables. In other 
words, for fixed py) the probability Ps is a strictly increasing function 
of each of the differences 


Pa) — Pt 


We shall need the following lemma. 
Lemma 1: For any pair of integers x, n (0 S x S n) and any 
6(0 @ <1), not depending on p, the function 


z—1 
p, 0) = > C;"p'(1 — + 0C,"p"(1 — (A3) 


is a decreasing function of p over the unit interval (0 S p S 1). More- 
over, it ts strictly decreasing unless (« = 0 and @ = O) or (x = n and 
6 = 1). 

Proof: Differentiating (A38) with respect to p gives after telescoping 
terms 


(0 — 1)xC,"p" — p)"* — O(n — x)C,"p'(1 — p)"*" (A4) 


which is negative for 0 < p < 1 unless (vw = 0 and 6 = O) or (x = n and 
6 = 1). Since H iscontinuousin p at p = O and p = 1| thelemma follows. 

Let XY, denote the chance number of successes that arises from the 
binomial process associated with 


Pu = 1,2, ---,n) 


the value of the integer ” is assumed to be fixed throughout this discus- 
sion and it will usually not be listed as an argument. The probability 
Pes of a correct selection for any configuration with pay > py) is 
given by the expression on the top of the next page. 


{ 
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k 
Pcs = P{Xw < for i= 2} +4 = and 


a=2 


Xi < Xq fori (A5) 
+7 => X (2 = Xw} 


It will be necessary to write the Pcs for any configuration with pay > pi) 
in another form which is more useful for the purpose at hand. Corre- 
sponding to any binomial chance variable X (which takes on integer 
values from 0 to n) we define a “Continuous Binomial” chance variable 
Y by letting Y be uniformly distributed in the interval (j — 4, 7 + 4) 
with the same total probability in this interval as the ordinary binomial 
assigns to the integer j, namely 


— py"? (j = 0,1, +++, ) 


We will now show that the probability Pcs of a correct selection is unal- 
tered if we replace each of the i discrete binomials by its corresponding 
continuous binomial. Let Y,;) denote the continuous binomial (CB) 
chance variable associated with p;;, and let y;:) denote any value it can 
take on. Let X,;, denote the nearest integer to Y,, and let x;;) denote 
the nearest integer to yi) (¢ = 1, 2, --- , k). Then Xj) is a discrete bi- 
nomial (DB) with the same parameters (p;,; , ). Let 


g(x, p) = C,"p'(1 — (x = 0,1, +++, n) 


Then the density g(y, p) of the continuous binomial (disregarding the 
half-integers) is given by g(y, p) = g(x, p) where x is the nearest integer 
to y. 

For two continuous binomials (i.e., / = 2) the probability Pes of a 
correct selection for any configuration with pj; > py) is given by 


n+1/2 
/ P{Y < 3 pa) dyay (A6) 


—1/2 


Pes(CB) 


n 
P{V < yay}g(ya 3 pur) dya (A7) 


Y2(,)—1/2 
Within any interval (7a) — 4, ra) + 4) we have 
< ym} = P{X < xa} 
+ P{X@ = a < ya | Xe = ra} 
= P{Xq@ < tm} + 3 P{Xe = rq} 


(A8) 


which depends only on xq) . Hence from (A7) 


‘ 
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Pes (CB) 


n (A10) 
= IP{Xe@ < 2m} = = ta} 


The above is easily generalized to hold for any k > 2. The details of this 
generalization are omitted. For general k this equality holds not only 
for the important special case pj) > py; but also for the more general 
case (2) for any ¢ < k. Since the latter result is not needed here, the 
proof is omitted. 

If we let G(y;p) denote the ¢.d.f. of the continuous binomial then 
lemma 1 can be restated in the following form. 

Lemma 2: For any integer n and any y, the function G(y;p) is a non- 
increasing function of p. In particular, for < y <n + it isa 
strictly decreasing function of p. 

Proof: For any y, set « = x(y) and 6 = @(y) equal to the integer part 
and the fractional part of (7 + 3), respectively. Then for any y we have 
the identity in p 


G(y;p) = H(x;p, 9) (Os 


For any yo such that —} < yo <n + 3 we have O S xr(y) S n and 


O Ayo) 1. The inverse function y(7,6) = x + — 3 is a single- 
valued function of the pair (7,8); the two particular pairs (0,0) and 
(n,1) correspond to the unique values y = —} and y = n + 4, re- 
spectively. Hence the pair [x(yo), @(yo)| must be different from these two 
particular pairs above since it corresponds only to yo which is in the 
interior of the interval (—}, » + $). Lemma 2 then follows from lemma 1 
and the fact that G(y;p) is identically zero in p for y < —3 and identi- 
cally one in p for y 2 n + 3. 

The probability Pes of a correct selection for / discrete or / continuous 
binomials for any configuration with pa; > py; can now be written as 


n+1/2 k 
ID PUY < ya} pm) dyay  (A14) 


2 t=2 


n+1/2 k 
= Il g(yspuy) dy. (A15) 


Clearly if any one or more of the pi; (¢ 2 2) decreases, holding py) 
fixed, then it follows from lemma 2 that the right member of (A15) is 


= A135) | 
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strictly increasing, i.e., for fixed pj; the Pcs is a strictly increasing func- 
tion of each of the differences pu; — pry (¢ 2 2) as was to be shown. 
It follows from the above result that in searching for a least favorable 
configuration among all those in which the experimenter wants his 
specification satisfied we may restrict our attention to those of the form 
(Al). Moreover, we may set d in (Al) equal to d* since, for d > d* and 
fixed py , the difference d — d* may be added to each py; (¢ = 2) and 
the probability of a correct selection is increased. Then (A15) reduces to 


n+1/2 
Pes = dy (A16) 

It was shown in the section on the least favorable configuration that 
there is a value pj; of pa; which when substituted in (A16) gives the 
minimum value P45 of Pes . 

We can now prove the following result in which py) is not fixed. For 
any specified pair pf2} < pri; the probability Pcs of a correct selection 
is smaller for the configuration 


= Pia) Pie) Pia Pa (A17) 
than for any configuration given by 
Pu = Pin; = = = Pir) (A18) 


This is shown by considering two separate steps. 

The first step is to increase py holding all the other p’s fixed at phy . 
For any arbitrary set of values of pji; with pu) > py; the probability 
of a correct selection can be written as 


k n+1/2 k 

Pcs = > | II Cups) | — dy 

by adding the probabilities that 
Ya > Yo) > min {Y@, Yuen, Yuan, Ya) 


for 


For 
= * 
Pu > Pe = Pi) = = Pw = 


this reduces to 


n+1/2 
Pes = (k — 1) [1 — dy (A20) 
—1/2 


Aga 
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This result can also be obtained by starting with (A16) with 
Pe) = = Pw = Po) — = pr) 


and integrating by parts. It is clear from (A20) that for fixed p,, (¢ = 2) 
the Pes is an increasing function of py) and is indeed strictly increasing 
for pu in the unit interval. 

The second step is to hold py) fixed and to decrease the values of 
Pt (¢ 2 2). This increases the probability of a correct selection by our 
previous result above. This proves the monotonicity property for the 
alternative specification. 


APPENDIX III 
LARGE SAMPLE THEORY — ORIGINAL SPECIFICATION 


For pay > py) the probability of a correct selection satisfies the in- 
equalities 


P{Xqa > Xw = 2,3, ---,k)} < Pes 
< PiXw 2 Xw = 2,3,---,k)} 


unless pay = | and py; = 0 in which case equality signs hold since the 
three quantities above are all unity. Letting gu; = 1 — pp), we can 
write the left member of (A21) as 


( 
P{Z;> 
+ (pm — + 


where 


= 
(A23) 


For the configuration (Al) with d = d* the chance variables Z; tend to 
normal chance variables V(0,1) with zero mean and unit variance as 
n— x. We have purposely omitted any continuity correction in (A22) 
in order to get a better approximation for the smaller values of n. 

To derive the least favorable configuration for large n we can restrict 
our attention to those configurations given by (Al) with d = d*. The 
quantity pji;, Which minimizes (A22), is obtained by maximizing the 
expression in (A22) 


Q(p) = — p) + (p — a*) (1 — p+ (A24) 
= —2p’ + 2(1 + d*)p — d*(1 + d*) (A25) 


re 
(A22 
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The derivative of Q(p) vanishes at 


= (1 +a*); gi) = 30 d*) (A26) 


which gives the symmetric configuration. Clearly this value of p gives 
to Q(p) its maximum value, 3 (1 — d*°). This proves that the symmetri- 
cal configuration is least favorable in the limit asn — «x. 

Under the configuration (AL) with d = d* andn-— x the distribution 
of the chance variables Z; (¢ = 1, 2, --: , hk — 1) approaches a joint 
multivariate normal distribution with zero means, unit variances and 


correlations given by 


(2.23) = = 1 A2 
° : + (pay — + d*) 


which do not depend on n. For the symmetric configuration this reduces 


to the simple form 


= 3 (A28) 


This is precisely the case which arises in [1] and consequently the tables 
in [1] can be used for our problem when (the answer) n is large. The con- 
stants C = ((P*, i) tabulated in [1] solve the equation 


| 4 
> — = 1,2, 1)| = (A29) 


\ 


for standard normal chance variables Z; satisfying (A28). If we equate 
C/+/2 and the corresponding member of (A22), then we obtain for the 
symmetric configuration 
& * 


v2 / | 
V5 (1 — d*) 


or solving for n and letting B = }C” this yields the large sample normal 
approximation 


(A380) 


(A31) 


Since d* is usually small when n is large and since the solution in (A31) is 
usually somewhat smaller than the true value, then it is of interest to 


examine the simpler approximation 


B 
n 


~ 


(A382) 


which is greater than the result in (A31). This is called the straight line 
approximation since it plots as a straight line on log-log paper as shown 


| 
| 
: 
ve 
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in Figs. 2 through 5. As d* — 0 both the normal approximation and the 
true value are asymptotically equivalent to the straight line approxima- 
tion. 

The normal approximation to the probability of a correct selection 
can also be written in another form similar to (A16) which is actually 
more useful for numerical calculations. The left member of (A21) can 
be written as 


(A33) 
1 11) + (pray Pr) {Wy = wy} 
V Pind 
where 
W;= = 1,2,---,k) (A384) 
np, 

and w; is the same function of xq) as Wis of Xq) . The outside summa- 
tion in (A533) is over the values taken on by w; as xq) runs from 0 to n. 

Asn — © the expression in (A33) approaches 


Il: V + (pry Pw) Vv f (w) deo (435) 
= V prada 
where f(é) is the standard normal density and F(¢) is the standard normal 
c.d.f. For the symmetric configuration, which is least favorable for large 
, (A385) reduces to 
Pp, & | (w + Fi dw (A36) 
—2 


A straightforward integration by parts gives the alternative form 


t=? 


9% 
Pts & (k — 1) 1—F (w F*~(w)f(w) dw (A37) 
. — - 


“3 
which corresponds to (A20). 
A simple method for computing such integrals based on Hermite poly- 
nomials is described by Salzer, Zucker, and Capuano. 


THEORY — ALTERNATIVE SPECIFICATION 


SAMPLE 


LARGE 


The expression corresponding to (A22) for the alternative specification 


is 
(in = Pi) Vn (i = 1,2,---,k—1)> (A38) 


P< Z;> - 


Ae 
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which is already written for the least favorable configuration. The tables” 


are not immediately applicable since the correlations 


* * 

(EB) (i #j) (A839) 
+ 


are not, in general, equal to 5. In the cases treated in Tables VI and VII, 
pr = 0.5 and hence piyghy < ptyqt so that the correlations (A39) 
are all less than 3. It was found that linear interpolation on the required 
value of n between the results for p = 0 and p = 3} gives moderately 


good results when n is large. The result for p = 3 is given by 


(pingiy + pigs) (A40) 


~ 
— 


with \ = 2B where B is given in Table V. The result for p = 0 is given 
by (A40) with X = do Where Ay is the solution of the equation 


P{Z > = (A41) 


which can easily be found from univariate normal probability tables. 
An explicit expression for the result of this linear interpolation is 


pny qn (4B do) + Pi21 


(for pix; 2 0.5) (A42) 
(1) — 


L 
Pes= 


The expressions for the probability of a correct selection for the al- 
ternative specification corresponding to (A386) and (A387) are 


pL, | (aw + b)f(w) dw 


a 


where 


1114 Pi n 
> 0 and b= =O (A45) 
V 


These expressions can also be evaluated by the method described by 
Salzer, Zucker and Capuano.” 


beats 
. 
| 
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APPENDIX IV 

TYPICAL EXACT CALCULATION 
A. Original Specification 

The exact expression (A5) for the probability of a correct selection 
for any configuration simplifies if the configuration is least favorable. 
For any pair of integers (j,n) we define 
bb = P{Xq = j} = O<j< (A46) 
be, =j} = C;"(piy d*)'(qiy +d*)""? O<j (A47) 
Bs; P{X@ Sj} (A48) 


hen the exact probability P¢és of a correct selection for the least 
favorable configuration can be written as 


n k—-1 *k—1 


j=0 i-ol + 2 
where B, _, is defined to be zero. Here, for each value of N4) , the letter 
i denotes the number of processes that tie with VY) for first place and 
for any given value of ¢ the conditional probability of a correct selection 
is L/(1 + 2). Taking & = 4 as a typical case, we can write (A49) more 
explicitly as 


n 3 n n 
= hy; ; l > hij bo; l by; bs; 
j=l 


j=1 2 j=1 


(A50) 
l n 3 
by ibe j 


j=0 
If n = 10 then we may use the symmetric configuration, i.e., we may set 
pit) = 3 (1 + d*), in computing from (A49) or (A50). 
B. Alternative Specification 


The probability Ps of a correct selection for the alternative specifi- 
‘ation is the same as in (A49) and (A50) except that we now define 


= P{Xw = = 1, 2) (A51) 
Bo = P\ Xe J} (A52) 
A typical exact calculation for k = 4, using (A50), (A51) and (A52) with 


* 
Pau = = 90.75 


|. 
— 
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and 


= 
Pi) = Pia) = = 


is given in Table Al. Exact values for the individual and cumulative 
binomial probabilities were obtained from References 4, 5 and 6. 


TABLE AI CALCULATION OF THE Pig 
* * 
Puy = Pry = 079; = = Pts = = 0.60; 


bij 


00000 -O0010 0.00000 00000 

.00003 00157 0.00000 00000 00000 
.00039 01062 00168 0.00011 .00000 00000 
.00309 04247 .01229 0.00180 00008 00015 .00000 
.01622 .11148 .05476 | 0.01243 00139 .00300 00016 
05840 20066 .16624 0.04026 02764 00459 
. 14600 . 25082 .386690 0.06291 .O1578 13462 04939 
. 25028 . 21499 .61772 | 0.04622 00994 B8158 .23571 
.28157 . 12093 .83271 | 0.01462 00177 .69341 
18771 .04031 .95364 0.00162 00007 90943 86727 
.05631 00605 .99395 0.00004 00000 98794 98196 


totals .00000 00000 


10 


>, = 0.44715 


7=1 


3 10 
be; B2;-1 = 0.08498 


10 


> b3,; Bo = 0.01464 


10 


bi; = 0.00145 
4 27=0 


Total = 0.54817 = Pés 


APPENDIX V 
In this appendix it will be shown that for large values of k the value of 
n required to meet any fixed specification (d*, P*) is approximately 
equal to some constant multiple of (In /). 
Let n = n(k) denote the unique positive decimal solution of the equa- 
tion 


+ b Vn) f(w) dw = P* (A53) 


KY 
ae 
) 
) 
0 
0 
2 
= 
a 
: 
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where f(w) and F(w) are defined above, P* and b are known constants 
with 1/k < P* < Land b > 0 and the argument & is a positive integer. 
Let ¢ be a (small) fired number such that 0 < ¢ < Min (P*, 1 — P*), 
Then ¢ < P* —1/k for sufficiently large k. Let A = A(e) be defined by 


1 
| fiw) dw (A54) 
—A 


so that 


Fw +b Vadflw) dw <¢ (455) 


for any integer k = 1, any n > O and any b > 0. Let n’ and n” be the 
unique positive decimal solutions, respectively, of the equations 


| + b Vn’)f(w) dw = P* 


A 
| F(w + b V n’’)f(w) dw = P* 
—A 
where P*, b and k are the same as in (.A53). It follows from (A55), (A56) 
and (A57) that for any integer k = 1 
(A58) 
From (A54) and (A57) we have 


pA 
| (w + bn’ )fa(w) dw = i (A59) 
where f,(w) is the density of the normal distribution, truncated at A 
and —A. The right hand member of (A59) is positive and less than 
unity since < 1 — P*. Hence there exists a with A such that 


A 
+ )falw) dw = + (60) 
—A 


Since wy, is bounded and n” is large for large k we can use the well-known 
approximation 


+ bv/n”) 1 — [—(wy + bn)’, 
V 2r(wa + bn’) 


1) exp | (wa + 
\ + ) 


where only the leading term is considered. Hence from (A59), (A60) 


| 
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and (A61) 


| 


In In 


l—e 


(A62) 
+ bn’) + In (wa + DV nr") 


Since wy is bounded and In V/n” = o(v’”) it follows that for large k 
n” & (2/6) In (k —1)2CInk (A63) 


where C' is a proportionality factor. Starting with (A54) and (A56) the 
same argument gives the same result as (A63) for n’. Hence, by (A58), 
the same result must hold for n. 
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